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EXECUTIVE SUMMARY 



workshop entitled "The Subject Access Problem 
Opportunities for Solution" was held in Springfield, Virginia, ^ 
October 18-20,. 1978./ The workshop was sponsored by the Committee 
for the Coordination of ^National Bibliographic Control which is 
supported by the National Science Foundation, the National 
Commission on Libraries and Information Science, and the Cduncil 
on Library Resources. Funds for^ the conduct of the workshop were 
provided by the Committee with additional financial support 
provided by the National Endowment f or ' the Humanities and the 
Council on Library Resources. 

The need to improve the ability of the user to find needed 
information through -subject access was the basis for the 
workshop. The Committee organized the workshop to: 

. highlight: the role of subject access with the 
emphasis on the needs of the information seeker, 

. compare techniques for subject access '^now being used 
in the, library and the abstracting and ind.exing (ASci) 
communities including applicable research efforts, 

. emphasize the role and nature of authority files in 
subject access"* control and facilitation, 

. s eek opportunities to ( 1 ) improve information 
preparation for subject access., through better 
functional interconnection and integration across all 
communities and (2) promote the use and enhance 
techniques for subject access to classe£3 of 
publications not now adequately covered. 

■ The planning committee assumed the responsibility for - 
synthesizing the recommendations from the deliberations of the 
workshop. While endeavoring to express the themes which were 
most prevalent, the planning committee does .not wish to imply 
that the highlights which are presented in the- chapter "Synthesis 
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aj>d . Action Recommendations'' are a formal consensus of the 

participants. Major themes which stood out strongly during tl, ^ 

workshop were;* ^ . ^ 

. diversity in subject access needs and approaches is 

^eal, is good, and will continue, 
. subject access mechanisms should be designed for end 
useri^ and not exclusively for informatior* 
^ professionals, 
« "post-^coordinate" natural language subject approaches 
for retrieval are likely to become more emphasized 
than traditional pre-coordinate subject heading 
systems to enable 'easier adaptation to changing 
^ interests and needs, 

. groups haying common interests for serving specific 
subject aiccess needs should unite for action; these 
, . groups 'must - include all relevant aspects of 

libraries^ publishers, abstracting and indexing 
organizations, and other information agencies., 
. some areas (e.g. , the humanities) , that are now 
deficient in subject ^access can learn from the more 
well developed areas but must adopt techniques 
selectively and knowingly, 
. universal standardization should not itself be a 
goal; in many situations interlinking of the various 
communities of ^ interest (e.g^, with respect to 
vocabularies) is more practical and acceptable, 
* a single national plan for subject access with 
centralized funding is not feasible in th?s 
pluralistic society and economy of the United States. 

The planning committee recommended certain specific actions 
that should be taken and pointed out that each recommendation 
requires organizing, funding, and ' a set of individuals to carry 
out the work. Those who are capable of providing the necessary 
initiative and support in fur*-herance of the overall objectives 
are not identi f ied in this paper, however, the" specifics of this 



paper have been brought, ^to their attention.. The summary of 
action recommendations follows: 

1. Interlinking of library- and A&I- produced subject- 
access tools 

a. Locate action agents in each of the library and A&I 
communities who are ' willing to collaborate in the 
development of demonstration projects in limited 

^ subject areas of the sciences that would focus on how 

to solve subject access problems in an integrater? 
environment. ' 

b. In a similar manner, carry out a demonstration 
project on the linkage between formal catalogs and 
access tools to manuscripts and other forms of 
information in ' a specific area of interest to the 
humanities. 

c. Additional efforts are needed to broaden awareness 
and understanding of the various kinds of subject 
access techniques and tools through: 

1) joint sessions and/or- special conferences 
' sponsored by professional associations; 

2 ) publications I including up-to-date textbooks and 
curriculum materials, that emphasize th'fe 
integration of e>/ailable resources. 

2. A "system view" oj" future sub ject-access ' mechanisms 

a. Conduct a research project on the design 
characteristics of a large dynamic file system based 
on file usage data from the users and from collection 
change and growth to examine 

1) use patterns and user behavior, 

2) collection purging criteria and phenomena, ^ 

3 ) user or use-created cross-reference systems . 

3. Gaps in subject access tools for the general* population 
a. Make and disseminate an analysis of the availal)ility 

and suitability^ of subject access .approaches and 
topis for users of school and public libraries 
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. Subject access for the humanities • ^ ■ \. " 
a. Conduct a- study to determine 

. the nature of . humanities literature ^ , ' 

2) how humanist? use information, ' 

3) the design of 

Character! ti . Wropriate to the 

Character i:.tics of the literature ' 
b. Develop the capability tn • ' ' 

evaluate techniques i. ^f^- 
apply relevant techniques. — '° ''"^^ 



Subject access to monographs 
a. Encourage and support projects that offer innnvV- 
-.. and practical solutions- to the orohV 

subject access t-o Problem of in-depth 

auujecT: access to monographs. 

t.. conduct an experime -.tal research prole^t to ,^ ^ u 
generation of index . niatW" f 

m..ohine-r«dabXe for. as a I " .^°"°3»P>^= . . 

controlled co;po.ition. "^^P-^"" of co„pnt.^ ^ 

Subject access authority "files 

a. ln-.lude the means, for ' linMng lib^^^ , ■ 
produced subject-access toBls b • " 

- participation of *oth communities in ^he d ""'"^ ' 

Of a rational authority system for thl ibl ' 
service Development Program. B^Wiogriph.c 

b. AS a related effort, assemble a group, of experts to 
study the feasibility of, defining codes of 

wher. _ ^^^^^ construction aL'use o" 

each sub.ect-access subsystem of the ^ 
authority system. ^ overall 
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INTRODUCTION , 



The d)iniTiittee for the Cooraination of National Bibliographic 
Control (CCNBC) was established to pursue spe ific, comiron 
problems of concern to all ''sectors of the information community. 
In its efforts to enhance bibliographic control those 
principles ■ and processes, by which bibliographic 4'eems are 
identified to the level required for the management of and 
intellectual access to information of .all types — the Committee 
has- relied upon the cooperation those who publish., process, 
store, and disseminate" information in all of .its forms. 

The Committee ,has done xts work ^V'ro.igh a variety of means, 
including chartering specific studies, reviewing various projects 
and activities, advising its sponsors, and -organizing workshops . 
In this, case, irecognizing that c->^ject access to inforroatior, is a 
diverse set of interests and activities and that it is tiuiely to 
focus more attention on' the status, problems, and devel opment.3 
relating to subject access, the Committee decided to hold a 
workshop, to explore, these matters and appointed a , planning 
con-.iittee to organize and .conduct it. 

Participants were invited from a wide variety of backgrounds 
and environments; a complete List is given i>^ an appendix to this 
report'. Support „ for conduct ing the workshc,')? was obtained from 
the Natione.1 Endowment for tV.e Humanities ,~the.^Council on( Library 
Resources, and from Committer funds.. 

This report represents the completion of the CCNBC project 
to focus attention on subject access. However, this report alone 
^is not sufficient t^o accelerate tKe actual solution of subject 
"^a^ccesa 'proM ems ' facing information ^sers . Organizations that 
have' the m...- .as to caiase things to hai^pen through their funding or 
• their operating plans must pick up these themes and bring about 
real and beneficial changes. 
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BACKGROUND 



. Of the several modes of retrieving information, the subject 
approach holds the' greatest and most, dynamic potential for 
serving information seekers, yet it . is the least,, understood, the' 
least s-tandardized, the least developed, - and, often, the most 
costly process. Subject-access systems may be characterised as 
customized, oriented by discipline, based- on historical 
■ precedenl^ and, usually, constrained by cbllection siz^ and 
financial , considerations. The scope _ of - "subject access- 

encompasses, th^ concerns of libraries and 'their subject catalogs,: 
of abstracting and i^exing services and their publications and 
services,^ of archivists and their inventories, and of publishers 
and their catalogs. Practices .in these fields have developed 
independently, with the result that there is little commonality' 
or compatibility. ' ' " 

0n6 of the several uses of the catalog in libraries is its. 
employment as a Jiool for sub ject . access — to locate soijrces cf 
information about a particular subject. The primary emphasis of 
^ card catalogs is on monographs, ^n the other hand, access to the 
" subject content of articles within serials and the report 
literature has been emphasized by abstracting and indexing 
publications and services. In both cases other fbrms of 
publications are represented, .^but io each case the nature of the 
document determines the approaches anc3 techniques- that ?re 
utilized. • " 

ALthough subject cataloging is a time-honored ' speciality of 
library science and although subject indexing is the foundation 
of abstracting and indexing , services, there is insufficient 
collaborati^on and integration of efforts toward sub ject . access ' 
among the publishing, abstracting and indexing, and library 
communities. And yet, from the information seeker's point of 
view, techniques and mechanisms cf subject access are at the 
heart of information retrieval. The need to improve the abiiijty 
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of the user to find needed information through subject access was 
the basis for this '^rarkshop. 

The topic is timely: card catalogs are *^being phased out 
and their function .'must be replaced; book catalogs, COM catalogs, 
and online access are emerging; and the proliferation of sys^tems 
for subject access is becoming more obvious to users attempting 
to approach a variety of dissimilar data bases. 

In view of these problems, the Commitee tor the Coordination 
of National Bibliogriaphic Control organized a workshop £0: 

highlight the role of subject access in library and 
' information systems, with emphasis on the needs of 

the information seeker; , ' . ^ 

. assess the nature and extent to which there is a 
subject-access problem and identify related unsolved 
* problems of the field; 

. compare the techniques for subject access now being 
used, including the related research emphasis and 
results in the library and tihe abstracting and 
indexing communities, and seek to improve 
communications between the two c'ommunities with 
regard to subject-access reisearch efforts; 

. emphasize the specific role and nature of. authority 
files in subject-access control and facilitation; 

• seek opportunities to (1). improve information/ 
preparation for subject access to aLl classes of 
publications through ' better functional 

interconnection ariv integration across publishing, 
abstracting and indexing, . and library operations and 
(2) promote the use and enhancement o5 techniques for 
increased subject access to classes of publications 
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not now adequately covered (e.g., monographs and 
audiovisuals) . 

As a result of the traditions of library cataloging and the 
current impracticality of doing anything more comprehensive, 
conventional subject cataloging is conservative in the number of 
* access points and ^ their uses, employing the fewest headings 
needed to cover the focus of a work without providing headings 
for more specific topics included in the work. While most 
cur. >nt cataloging systems are capable theoretically of 
specificity, various practical considerations often limit the 
provision of highly specific headings unless a work itself deals 
only with a specific topic. In-depth cataloging is lacking for 
general works. This is exemplified in serials, where libraries 
have largely confined their cataloging effotts to the title level 
and provide no basis for detailed subject access. 

Meanwhile, abstracting and indexing services and information 
dissemination services are providing subject, access to various 
classes of publications. Some are aimed at maintaining 

consistency with library subject systems, for example the H. W. 
Wilson Company's^ Humanities Index, Social Science Index, 
Applied Science and Technology Index , Gtc. Others have evolved 
into independent, detailed subject-access tools, such as the 
subject indexes of the abstracting and indexing services, and 
have interface only at the title (cataloging) level. Yet others 
have developed a document delivery function, for example, the 
Educational Resources Information Center (ERIC) and its 
companion, the ERIC Document Reproduction Service (EDRS),. The 
document . delivery function has recently received considerable 
attention, including the interest of the National Commission on 
Libraries and Information Science (NCLIS), the National Technical 
Information Service (NTIS) (although its Journal Arti^cle Copy 
Service - [ JACS] was recently discontinued), ^ the National 
Commission^ on New Technological Uses of Copyrighted Works 
(CONTU), and others. Another difficult matter to be cqnsidered 
is that libraries and indexing and abstracting services have 
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avoided identifying content for particular audiences. For 
example, the treatment of a subject will be much different 
depending upon whether it is aimed at scholars, the general, adult 
population, or elementary school children. 

Two examples of integration of the library wil-h computer- 
based search access functions into a single organization are the 
National Library of Medicine and its MEDLARS and the National 
Library of Canada and its CANOLE. However, even in these cases 
the approach is based on ccmmon management of two kinds of 
activity and does not represent a true, funcjLional integration. 

\ 

A major factor in traditional library pi^^i-tice that affects 
sub ject- access as well as collection management is the growing 
difficulty in constructing and maintaining large and 
comprehensive card catalogs. This is reflected in discussions 
about and, in some cases, plans for replacing card catalogs with, 
for example, computer-based access- systems. Some totally 
computer-based systems for library management and information 
retrieval, such as BALLOTS, have included elements • of 
computerized text search functions iri their original design. 

With the incret^sing computerization of the information 
needed for management of and access to information . cdllecticns , 
there is a concurrent requirement to bring these two functions 
into harmony. There is also an opportunity to -intro'duce new 
methods: It is essential that collective expertise be brought to 
^ bear on solving these problems. 

The participants in this workshop included subject-access 
users from various disciplines as well, as providers of 
information reference librarians, i formation specialists, 

systems designers, and library and information systems managers. 
The workshop concentrated on significant technical issues and 
approaches as well . as on system design and operating management 
concerns . 

A list of questions was prepared and used to stimulate the 
discussions: 
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How can we eliminate the expensive and time-conQuming 
duplication of intellectual processes in creating 
subject access to bibliographic entities? 

What are the "elementally particles" of the subject- 
access universe: subject headings, index terms, key 
words, descriptions, identifiers, links, roles, 
classes, text, codes, etc.? What are their 

differences and similarities? 

What is the impact, or potential impact, on the input 
process ( including cataloging, indexing and 
abstracting) relative to text searching capabilities 
in information retrieval systems? 

What practices for -indexing that have been developed 
for specialized use in various* disciplines should be 
more widely adopted for general- use? 

'Can subject access be provided as effectively for 
monographs as _ f or items like journal articles and 
technical reports dealing with more specific subject 
areas? 

> 

How can libraries effectively provide users with 
subject access to tnt collections through data base 
search services? 

What is the role of present standards that are- 
applicable to subject access? How effective are 
they? Are new standards needed? 

As files get larger, must subject access necessarily 
become more complex and expensive? Should files be 
partitioned ~- for example, by date, document type, 
gross categories, etc., — as they get larger? 



WORKSHOP DESIGN 



TOe workshop was limited to thirty--five Invited 
participants, plus seven observers* It consisted of six sessions 
beginning 'with an evening session and followed by two full days. 
A broad mix of participants was invited in order to obtain a full 
range of points of view and experience. 

The workshop used an analytic approach. A set of questions 
was posed: What is the subject-access problem? How is subject 
access handled? What are., the related questions that lead to 
problems? The special characteristics of ^ each problem were 
considered, wirl; full atjbention given to differences arising from 
such things as discipline or subject area, type of material or 
medium, etc. The second part of the analytic approach was to 
discuss the alleviation of some of the problems th^v had been 
identified in the first half of the program and theu attempt 
to Jidentify prospects for working cpn them cooperatively. 

One of the objectives . of the introductory evening session 
with the keynote, speaker was to. place "Subject Access in 
Perspective" as one of many ways of accessing information. Other 
objectives were to (1) start the thinking that led to answers to 
such questions as why, when, and how subject access is, used and 
for what purpose; (2) characterize the nature of subject access 
as distinguished from other ways of finding information in 
collections; and (3) emphasize ±he fact that subject access is 
lagging behind other means of access in terms of technique, 
degree of automation, and other qualities needed to provide good 
and complete service to information seekers • 

The first full day of the workshop started with a session on 
"Subject Access: Where are we now, and why haven't we gotten 
further?" Each member of a panel representing a mix of 
backgrounds stated briefly a point of view that wa::: related to 
the ' theme. General discussion by the ' workshop participants 
followed. 
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T'oTTdwing "tKalT, xti — th"e — a fter noon , — the - *;ark-shop — discu s-sed- 



"Current Research and Development in Subject Access During 
this session current activities that may contribute techniques 
and experience toward solutions for the future were reviewed and 
compared. Ideas and results from the diverse backgrounds and 
knowledge of the participants were shared to establish a broader 
awareness of work now being done in specialized areas. Existing 
work on intercommunity interfaces, lirdts of technology in 
achieving various solutions, and related cost factors were 
included in this session. 

( The evening of the first full day featured "Subject Acce.«:s: 

Challenges for the" Future." The purpose of the formal 
presentation was to stimulate the participants* thinking in order 
to begin the transition from the problem identification and 
current activity nature of the earlier sessions to analysis and 
problem solving. 

The morning of the second day was devoted to "Subject 
Access: Key Problem Areas and Issues," in which the discussion 
was based upon a list of problem areas, issues, and questions 
developed fron the earlier sessions. 

An extended noon period provided for lunch in small groups 
to which participants were assigned. Informal discussions 
generated recommendations to be considered 'by the workshop 
participants in the afternoon. A leader and a recorde'^ were 
designated for each group to organize the discussions and 
reporting. 

The final "session involved all participants in the review 
and discussion of the suggested recommendations. 

The planning . committee worked from a summarized transcript 
prepared by a professional conference recorder to produce a draft 
of the workshop proceedings. This was circulated to participants 
for comment and critique. From this draft of the workshop 
proceedings and the participants* comments and critiques, the^^ 
planning committee constructed the "Synthesis and Action 
Recommendations." The report is now being provided to the 



1 



-9- 



spWsors who .supported the workshop and ,to other interested 

institutions and associations for their consideration cr 
actions • 
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Definition - WHAT IS^ -SUBJECT ACCESS? 

While the term "subject access" was used extensively in the 
planning and in discussions with the prospective participants 
before the workshop, it was not - realistic to assume that 
everybody meant the same thing by it. Thus, each participant was 
invited to furnish a definition before the workshop convened by 
completing - the plirase "By subject access is meant..*" The 
planning committee, independently, constructed its own. 

These definitions reproduced below, without attribution, 
provide a snapshot of the frame of reference of the workshop 
participants as they assembled. 

The planning committtee definition wai.?: "By subject access 
is meant the. use of words, -phrases, or symbols to represent the 
intellectual content of recorded knowledge for purposes of 
organization and retrieval." 



As expressed by the participants, by "subject access" is 
meant. . • 

1. ...the ability to retrieve information or documents, etc., 
from their theme or unit concept. 

2. ...in the broadest sense, gaining access to a desired docunient 
by means of a subject heading. More specifically, as a producer 
of abstracting and indexing publications , it means locating a 
document of interest by- looking it up in the subject index of a 
bibliography or online service. 

I hope that the consideration of subject access will include 
a . comparative evaluation of current techniques of subject access, 
as wel 1 as a comparative, evaluation of different -.ypes of 
authority files. I would further hope, that the problem of 
subject access will be discussed in the context of current and 
developing technology , especially the computer, 



3. .••access to a document by means of its actual content as 
contrasteia to access by means - of — its tTtley author names,- and- 
routine identifying citations. Such subject content is usually 
referenced within the document title but often xn a very narrow 
sense. Subject access provides many additional routes to the 
document not available via the title alone. In today's 
environment of multidigcipline and mission-oriented interests, 
this increased subject access is most important. Much important 
information is buried within documents and is, not retrievable by 
way of routine citation searching. 

Subject access has another important objective. It places* 
the content of a given document in a relationship to other 
documents with the same or similar contents A given subiect 
serves as a central collGcting point for all documents of similax 
content. Since individual subjects gleaned from the document 
content are more specific than the titles, the resulting access 
points are more effective for searching. Subject access, ur.like 
.a formal title, is amenable to vocabulary controls Such control 
results in ^ more ordered, more efficient, and less jostiy 
sear.ching. 

4. ...access to topical information/material, by means of 
descriptors in an index or catalog. Access to topical 
information/material by means of numerical or letter/number 
notations (e.g., Dewey Decimal Classification). 

5. ...the .means by which any given collection of information may 
be searched by subject, topic, theme, k<:!y word(s), oi^ element 
within . a bibliographic entry ■ other than the "standard'* ones 
(which I take, to . mean, e.g., author * s name:, title of work, 
imprint). For the "Cumulative Shakespeare Bibliography" we are 
preparing under a grant from the NEH, our electronic data- 
processing system permits access to data contained in all the 
recognized elements^, of standard bibliographic entries organized 
within a comprehensive taxonomy, but also, by employing a 
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thesaurus-'term .'^seeding" we have the capability to search the 
file by subiect and Lhente. irrespective of the taxonomy- 



6. ...that a range of intellectual approaches must be used, in a 
disciplined manner that are keyed to different levels of 
arrangement of the material. Serial arrcingement and collective 
description (as distinct from item ' description) are the main 
characteristics. Content analysis of record series for their, 
subject matter is only one form of intellectual access to subject 
matter, and is the most expensive. One of the chief modes, and 
the least expensive, capjiuilizes on the user's prior association 
of proper names with activity/subject matter. A high degree of 
prpper name , control corrorspondingly provides a high degree of 
subject access, which is independent of content analysis. 
Entries for place namv«;s and names of political subdivisions also 
provide modes of subject access for those (a small minority) 
using this route. 

7. ..•a means of identifying and retrieving . a bibliographic 
entity by querying a data, file on fhe basis of discretely and 
systematically codified sub jects. 

8. ^ ...providing effective yet economical access to a 
heterogeneous mixture of information sources within a well- 
defined: or hierarchical research design. 

Subject-access systems unless of the highest priority (such 
as those^ necessary for Nati^6nal Defense and Medical Diagnosis) , 
should not provide instant answers to unstructured questions. 
Subjecb^^access systems should be designed to lead researchers in 
the direct>i^ they want to go through a series of increasingly . 
better defined^and rtdre specialized searches for sources to 
consult. Subject, access ultimately means the researcher working 
with a manageablev amount of. material, not providing the 
researcher with answersxto questions posed. 
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9. ...the avj/ility of a person to obtain facts, materials, 
information, etc., regardless of their age, geographic location, 
economic status, or social status. It means the access through 
-the .traditional means, i»e., a library, a^^^ thrbugii 
nontrac'itional means, -community service centers, computer data 
bases, and other people. Accessing this information., for people 
should be free of the prejudices of those assisting in the 
accessing, and free of cost to all. These materials and 
information may be available in a variety of formats also, 
including book, microform, recordings, video cassette, etc. The 
responsibility of those locating the information is to search the 
broadest possibl e"~ r ahg e o f ~To^^ and s o uf c e s . ThT¥"nd"eT"rni"trio n' 
o^E 'subject access may be very broad - not limited to our 
traditional use of the card catalog - as I feel that the locatio i 
of information should go beyond the four walls of library. 

10. ...the operations necessary for the retrieval of recorded 
data or information from the media on , which it is stored by the 
use of subject concept labels that have been assigned to the data 
or - informaton for purposes of identification or whiclr^ were 
inherent in the data or information. The term "subject Srccess" 
should also include, to the degree that it is necessary for the 
purposes of the information seeker, timeliness, low cost, 
relevance, comprehensiveness of recall, ease of us<5 of the 
system, precision, accuracy', unambiguousness, and provision of a 
useable replica of the data or information. 

11. ...the approaches to subject matter in a collection. The 
^approach may b^ systematic (a.s in the classified arrangement of 
books on ,a library shelf) or the approach may be topical (as in 
the subject headings in the card catalog) or the approach may be 
" natural or free " (as in the title words or words in an abstract 
or., subject description if they are for matching with query 
words ) . ' 

Ail of. the above approaches may be considered the result of 
subject cataloging an'd/or the features of the retrieval system (s ) 
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available to the searcher trying to find the subject matter in 
the collection. 

\Shera and Egah in 1956 said the objects of subject 

"ca't'alogi'ng should be: - -- - - - 

a. to provide access by subject to, all relevant material; 

b. to provide ' subject access to materials through all 
suitable principles of subject organization, e.g.. matter, 
process, applications, etc; 

c. to bring together references to materials which treat 
substantially the same subject regardless of disparities in 

-terminology, disparities which may have resulted from national 
differences, differences among groups of subject specialists, ^ 
and/or from the changing nature ,of th^ concepts within the 
discipline itself; . ^ 

d. to show affiliations among subject fields, affiliations 
which may depend upon similarities of matter studied, or of point 
of view, or upon ui;« or application of knowledge; 

e. to provide entry to any subject field at any level of 
analysis, from the most general to the most specific;, 

f. to provide entry through any vocabulary common to any 
considerable group of users, specialized or lay, 

g. to provide a formal description of the :subject content 
of any bibliographic unit in the most precise, or specific, terms 
possible, whether the description be in ' the form of a word or 
brief phrase or in the form of a class number or symbol; 

h. to,, provide means for the user to make selection from 
among all items in any particular category, according to any 
chosen set of criteria such as most thorough, most recent, most 
elementary, etc. 

12. ...being able to find out about pink elephants, magenta 
pachyderms or light red mammoths. It is being able to find 
information, or ref erences_to information, by topic, at ' varying 
levels of specificity no matter in what form, i.e., book. 
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article/ conference paper , patent , report, tables, drawings, 
photographs, bibliographies, or in parts of any of the above. 

13. . . .a way of coding the_. content of documents in a collection, 
by creating a subject vocabulary (alphabetic, numeric, or other), 
which can then -be used tc identify documents relevant to a 
particular search need. 

14-.' , . . THE Ability of user s, (such as Researchers, library pa- 
trons, students, librarians , ^ government of- 
ficials and information seekers in general) 

OF A DATA BASE (such as a card catalog, an online 

. information file, a periodicals index, a 

publisher's catalog, etc.) 

^" TO LOCATE REFERENCES (such as cataloging information, 

bibliographic citations, , legislative 

^ information, etc. ) 

TO NEEDED INFORMATION BY MEANS OF SUBJECT CONTENT 
DESIGNATORS as indexing terms,* subject. headings, 
descriptors, keywords, etc.). 

15. .an . approach to organization of and.' subsequent retrieval 
from a body of public knowledge represented- by documents, on the 
basis of their contents and potential for conveying information. 
Since, by public knowledge is meant a rational consensus of ideas 
and^ information, sutTject access is predicated on the existence of 
a consensus and limited to that consensus., ; 

16. ...a method of' retrieving information from a document 
collection such as a card catalog or machine-readable data base, 
by addressing the topics which are discussed in the documents or 
members of the file and which/ are usually represented as natural, 
language terms or added index te^ms. 
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17. ...the search strategy or tactics involved in retrieving the 
manually, semiautomatically.gbr automatically indexed content of 
documents . - ■ 

18. ...when considered in practical terms, a connection v;ith the 
online retrieval systems that will prevail in the future. In 
these circumstances,, subject access must then be .provided with 
respect to a wide variety of different criteria, involving large 
sets of free-form vocabularies. The questions connected with 
subject access include the handling of information items 
described .. by diverse, nonstandardized information identifiers, 
the -harjdling of incompletely or ambiguously identified 
information items, the identification of incompletely specified 
information requests, the matching of information requests with 
information items. 

■ 19. ...identification of relevant records based on subject 
searching, particularly for concepts » that are not eas^y^ 
described (e .g .," street people," "white flight," "honesty'r in 
government"). I want to be able to find publications on hov/ to 
install various kinds of bricjs,-' walkways in my back yard. 

' 20. ...the means by which people who seek information use words - 
- a word or word group - to find what they seek. These words, 
taken from * language , may be controlled or uncontrolled/ in varying 
degrees of organization according to a, wide variety of schemata, 
schedules, and classifications. 
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SUMMARY OF WORKSHOP PROCEEDINGS 



The Subject Access Workshop was designed to approach the 
topic of subject access frorr] a very broad perspective, addressing 
the range of subject-access systems and, most important, 
including among the workshop participants ^ representatives from 
many pf the groups involved in subject access. Subject acc^iss 
was viewed by the workshop attendees as a series .of filters 
providing - access/ to information at different levels. 
Participants in the sub ject-access system, all involved with this 
stepwise processing of infortriation, include publishers, 
abstracting and indexing services, information , dissemination 
centers, libraries, and, ultimately, the end-users^ All werie 
represented at the workshop-;' as were a range of disciplines 
including the sciences ai;d the humanities Educators, 
researchers, and funders in the sub jectr-access area also 
particpated. \ \ « 

As indicated in the workshop objectives, the emphasis was on 
identification of the contemporary subject-access issues and 

.related recommendations for action. At the workshop, several 
types of activities served- as background to • this major task, 
including sessions on the current situation and ongoing research 

.and development areas. Addresses on "Subject Access in 

Perspective" (F. Wilfrid Lancaster) and "Challenges for the 
Future" (Margaret T. Fischer) are included as appendices to this 

.report • Each activity contributed ^to a more thorough 

understandirjg of the subject-access problem by the diverse group 
of participants in attendance. 

The definitions of sQbject access provided by the workshop 
attendees prior to the meeting (and reproduced in .the previous 
section of this report) gave ah indication of the diversity of 
the viewpoints that would echo . throughout ^ the- workshop. 
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Definitions varied for the types of primary materials arid 
subject-access systems referred - to, in the techniques mentioned 
as appropriate, in the purposes ■ of subject access, and in the 
degree of emphasis placed on the materials' accessed versus the 
user. 

a. 

Professor Lancaster's presentation opened the workshop and 
addressed both the present and the future of subject access. He 
mentioned many of the topics that would emerge from the workshop 
^^.J^^y issues. Major developments since World War II were noted: 
the development of post-coordinate- indexing techniques, the 
implerjnentation ' of such methods through increasingly more 
sophisticated means, and the growth in machine-readable data-^base 
activity and in associated telecommunications activity. At this 
point, the workshop acknowledged that subject-access systems are 
complex, the success or failure of which are governed by a number 
of factors -mostly outside the control of the information center 
providing searches. The complexity of subject-access systems 
also suggests the need for multiple approaches . to system design 
and the difficulties in evaluating performance. Current gaps in 
our information . primarily involve the user-system interface, 
including user search behavior* 

^Speculating on the future. Professor Lancaster noted that 
subject-access systems will be operating in . a changing 
environment in which information is acquired on demand in a pay- 
as-you-go process. Use will switch from a delegated or mediated 
tnode to direct user-system- interaction. Search systems will 
utilize both the conventional and newer approaches, including 
some which are language independent. An unlimited variety of 
d. a bas^ will be available to the user, and the system will 
assist in the selection of both data bases and references within 
a data base. The' language of the systems is likely to be hybrid 
— natural language input and a post-controlled vocabulary at 
output. 'This picture of the future stimulated workshop 
participants to consider trends of present developments and to 
evaluate them in terms of the overall goals of subject-access 
systems . 
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• ,,.The session -cm "Subject Access: Where are we now, and why 
haven't we gotten) further?" included presentations by four 
workshop participants representing libraries, abstracting and 
. indexing., services, information dissemination centers, and 
publishers. -Each discussed the current state of affairs relative 
to subject access in his segment of the information community, 
goals and objectives, and some of the major problems encountered. 
The individual presentations also brought out some of the 
interactions among the different ' segments of the community 
represented. 

Mary Kay Pietris, from the Library of Congress, addressed 
^the topic of development and use of LC subject headings. They 
are, she pointed out, both widely used and widely criticized 
criticisms relate mainly to their use of outdated and/or 
inappropriate terminology, to various inconsistencies, and to 
time delays in productibn. Barriers to resolving these problems 
include the lack of a code, or theoretical basis, for the 
practices followed and the difficulty of modifying large files of 
back records both at LC and in other libraries to make them 
consistent with newly adopted practices. 

Russell J. Rowlett, Jr., from the Chemical Abstracts 
service, described the goal of subject indexing by the 
abstracting a'nd indexing services as the provision of access to 
the contents of documents, one step in a series of filtrations of 
information available to the potential user. Subject-access 
issues in the abstracting and indexing community include 
•authority control, cooperative terminology control, and 
controlled versus uncontrolled search vocabularies. 

Kay Durkin of Bibliographic Retrieval Services pointed out 
that considerable work has been done- on a file by the time it 
reaches the information dissemination center and that this 
.constrains further activities. Other constraints are imposed by 
the search system used. Information dissemination centers are 
mainly responsible for formatting bibliographic tapes for 
searching and for developing appropriate search systems. The 
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diversity of file formats is a proble. in loading the files and 
is also a problen, in the education of users by the information 
dissemination centers. 

Luoille Gordon of McGraw-Hill diccussed the role of the 
commercial publisher in providing subject access. S^e pointed 
cut that the limited subject-access activit.es — ^ - 
primarily bac. of the booH indexing and assignment of broad 
subject categories - are seen as sufficient by n,ost publishers 

m-;r.v,+- be orovided by the marketplace, but 
An incentive to change mxght be provx :r . 

libraries are not a highly visible part of that marketplace, and 
there must be some business advantage to warrant doing the extra 



work 



several com„,onalities were identified by workshop 
participants in the presentations given. In viewing the overall 
flow of materials from author to user, with all the attendant 
subject-access activities, a trend toward centrali^tion was seen 

^ \ r^ai-i-i rularlv in the form of a 
by some (but not by others), partxculaiiy xn 

. . X.- ^^i-^ base. The need for awareness of 

national bibliographxc data base. m 

. . .u =r,ri a willinqness to cooperate wxth 
other system partxcxpants , and a wilixngn p 

them to improve subject access, was also a common thread of the 
presentations. ^e existence of, and the need to buxld upon a 
diversity of subject-access systems was recognxzed and the key 
issue Of linkages between systems was raised by several 
participants. 

^ ■, r.^ Q*:.ssion on "Current Research and 

A second background sessxon ^ 

Development in Subject Access" included a sampling of a number of 
current research efforts in the subject-access area. Topics 
covered included the Subject-Access Project recently completed by 
Syracuse University (Pauline Atherton) . the Preserved Context 
iLex system , (PRECIS) (OavM Batty). and sub^ect-acces 
activities at CCLC. Inc., (formerly Ohio College Library Center) 
(Ann EKstrom). Shorter presentations were also given on subject- 
access activities at the chemical Abstracts Service (RusseX 

w ,r.<-s.as in the humanities (Richard Lineback) , 

Rowlett) , subject access in 

. f fhe conaressi^nal- Research Service thesaurus 
development of the Congress — 
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( Frederick Rosenthal), and- automatic indexing (Everet*^^ enner) ^ 
Generally these presentations gave an indication o ^.,v,-iec 
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access techniques currently being tested or in use. ^J^i"' 
picture was of considerable diversity, ranging, for eJi^'^PIe, f^°^ 
the highly structured PRECIS to several examples "natura^ 
language" approaches. A user orientation was strong ^nost 
the .presentations, and the importance of system interf;.^es an^ 
the need for building upon existing systems were also ^"^Phasiz^ 

The transition for the workshop group from conc^"tt^^ion 
the current status of subject access to a f ut^^^^-orient 
discussion of issues and reco^umendations was provided ^ Margate^ 
Fischer in her address "Challenges for the Future." ^ ^ '''^"'^'''^^3 " 
the participants that the goal of subject access i^ ^ ^ 
information on all levels, in all depths, to ^3- kinds 
people. She' emphasized the different types of prima^^y m^terxa^^ 
and different types of user needs and the opportuniti^^ ^Vail^^^^ 
to develop responsive systems. A series o.f challe^^^^s „aS ^ 
out: to 'vork together to provide subject access, ^ ^ ^over 
forms and types^of information, to respond to the mafl^^tpi^ce, 
use technology in support of subject-access system^' to b^^^^ 
associative systems, and others. Finally, she ^^^^J^^sed 
basic challenge as that of maintaining knowledge ^ ^lal?^^ 

unity through sifting, reviewing and synthesizing, mO^^n^ tows^ 
H. G. Wells' S i concept of a dynamic world encyclopedia* 

The next i workshop session dealt directly with s'lA^^ect-access 
issues, with participants invited to put forth key F^^^iem ar^as 
and concerns which they had indentified. In this w^^J^ ^ .^^^^^g^ 
twenty-five issues was created and discussed. M<?^^ of tb^^ 
related to' ithe major themes evident in earli^^ works oP 
discussions. 

"Diversity" was the conference key word, emer^^^g over 3""^ 
over again as a" characteristic associated with eve^Y Aspect ^ 
subject access. Diversity, was seen in the materia^-? Ptocesse , 
in organizations doing the processing,^ in technique^ Jo^ ^u^^^^^ 
access, and, perhaps most important, in users of s<^ 3ect-acce s 
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systems. Generally, diversity was seen as positive, as the only 
means of dealing with the given variations in materials and 
users. Centralization and standardization were discussed, and 
were found in ' most cases to be less desirable than the 
maintenance of diverse systems with bridges or interfaces between 
them. In the area of techniques, post- rather than pre- 
coordinate systems were seen as preferable, with hybrid systems 'a 
possibility. Several bases for new techniques were meiitioned. 
Throughout the discussions, the need to focus on the users ^ — to 
understand their needs and to design systems in response to them 
— was emphasized. 

The issues identified by workshop participants fell 
generally into three categories: tho^e associated with the 
subject-access systems themselves, particularly coverage, 
vocabularies, and search techniques; those associated with the 
system environment in some way; . and those involving the 
interfaces between different subject-access . systems and between 
users and systems. They are listed and briefly discussed below. 

Issues Relating to Subject-Access Systems 

. Diversity of subject-access vocabularies 

. Little subject access for some types of publications 
and for some subject fields 

. Outdated subject-access systems 

. The extra effort required for expanded recording of 
subject-access entries 

. The need to supplement subject access with quality 
indicators 

. The need for techniques for purging collections 
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. Pre- versus post-organization of data collections 
. Time dependencies of subject-access vocabularies 

These issues covered a wide scope, addressing both general 
concerns and specific system aspects related to vocabularies 
used, search techniques, -and other system characteristics. 

Two general problem areas were mentioned: the lack of 
subject-access systems for some types of publications and for 
some subject fields, and the outdated nature of some subject- 
access systems. Differences in coverage for books and journal 
articles, the inadequacy of subject-access systems in many areas 
of the ' humanities, and the limited initial efforts in subject- 
access systems associated with audiovisual materials and 
nonbibliographic data bases are examples of the first problem 
area. The need for appropriate systems for all categories of 
users, including the general public as well as researchers and 
educators, was also mentioned. The point was strongly made by a 
participant that the information needs of poor people, 
minorities,' women, disabled persons, seniors, and prisoners are 
not adequately met by existing means and that subject-access 
approaches suitable for these audiences were different than those 
usually considered -by ^ most library and information service 
institutions. In connection with the question of outdated 
systems, the workshop participants recognized the related problem 
of the difficulty of modifying or replacing an already existing 
system, but saw a need for up-to-date systems which have maximum 
responsiveness to user needs. 

The diversity of system vocabularies may be either a problem 
or a strength, but cerv^.ainly it is a fact of life to be dealt 
with. Here, as elsewhere, the need for bridging techniques 
between vocabularies was noted. Historical developments 

alternating between natural language searching and controlled 
vocabularies were suggested. Another vocabulary problem that was 
identified concerned the time . element; basically, whether the 
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vocabulary used should be associated with the ti„e of generation 
of the primary material or the time of its use/ Snd how are 
linkages between the two frameworks established? This problem 
has one solution in manual files and possibly another in ■ the 
newer automated files. 

Several issues revolve around the development of user- 
responsive systems, and three specific techniques were discussed. " 
one xssue mentioned throughout the workshop was pre- versus post-^ 
coordinate systems, with the advantages and disadvantages of each 
xn .different. circumstances addressed. a suggestion for 

supplementing current subject-access systems with quality or 
nonsubject indicators was made in order to expand the access 
paths available to us.rs. Appropriate nonsubject indicators 
Should also be included and the ability to search on them 
provided. ^Another .element of subject-access systems which could 
provide better services to users " would involve maintenance of 
records of system use. ' This conc^ept , involves both storing of 
successful search requests for others" use and the use of 
performance data, in system modificati6n and development. 

Also, techniques for purging or partitioning files were 
discussed.. 'H.ough this issue relates ta user considerations, it 
xs. more concerned with the technologies utilized by systems and, 
associated cost considerations. 



Issues Relating to Subject-Access Envi 



ronment 



Lack of R&D support for subject-access methods and 
systems . 



Environment needed for initiating change 

impact of technology on subject-access policies and 
practices, and user characteristics 

Cost effectiveness of subject-access processes 
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Institutional responsibilities and relationships 
Diversity of _ topic coverage . 

Special characteristics of the humanities literature 



r 



Need for a relationship of subject-access to numeric 
data collections 



A number of distinct issues can be identified when one 
considers the environment in which subject-access systems 
function. This environment includes materials on thei input side 
and users on the output side, and involves institutional 
participants including publishers, abstracting and indexing 
services, information dissemination centers; and libraries. The 
role of researchers and funders must also be taken into account. 

Some of the questions th^t arise in the evaluation of any 
large .system relate to participant interactions, system costs, 
and system performance. In the subject-access field, the issue 
of institutional responsibilities and relationships is a 
particularly complex . one, ' since the subject-access chain is long 
and involves participants with widely differing goals ^ and 
incentives. Analysis of all the subject-access systems for books 
and monographs suggests a number of structural shifts which have 
occurred and others which appear likely in the future. Awareness- 
of the likely effects of these structural changes on the subject- 
access system is critical. 

Considerable work has been done, mostly several years ago, 
on .the evaluation bf subject-access systems from the standpoint 
of costs and/or effectiveness. New techniques and new systems 
will require new evaluations, utilizing different measures of 
effectiveness than those used for earlier systems. . This issue is 
not strictly a research issue; it also includes cost- 
effectiveness as a criterion for evaluating subject-access 
systems. 
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Throughout the workshop it was noted' that rapid changes are 
occurring. Many of these changes are precipitated by an outside 
element, that is, new technological developments. As in other 
areas', the potential impaot of technology is considerable and 
J^A^J^^ -^9- -^^ ^he s ubject iof careful study . Of particular interest 
are new subject-index systems which can be designed to • take 
advantage of hew technologies and the related effect of such 
systems on policies, user-9> etc. The impact of technology to 
date on existing subject-access systems should also be 
considered. 

In some areas, changes may not be occurring as rapidly as is 
desirable due to inertia and/or the difficulty of making changes 
in compl.ex systems. Yet, as already noted, new developments with 
the potential for improved user access are available, and will 
continue to "become so. One need expressed by the workshop 
p-'^.rticipants in this regard is the creation of an environment for 
initiating change. 

Many of the issues cited by workshop participants call 
attention to specific areas of research needs, a number of which 
deal, with overall system considerations on a national scale. 
Others mentioned the need for research and development to exploit 
new techniques and technologies. In contrast with these 

expressed needs, a lack of R&D support for subject-access methods 
and systems was noted as an issuQ. 

A final group of issues related to the subject-access 
environment concerns the material available for coverage and its 
characteristics. The range of materials requiring subject access 
is considerable, and characteristics vary considerably. In 
addition, the degree of coverage of different groupsr of materials 
varies substantially. Singled' out in this area by the workshop 
attendees were the humanities literature with its distinctive 
characteristics and numeric data collections which lack subject- 
access systems. 
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Issues Relating to Interfaces 

" ') ■ 

System 7- system; 

, ■ * 

• Need for bridging techniques f or sub ject-access schemes" 

• Need for descriptive register of data bases 

• Relationships between library catalogs and online 
access services 

. Need for comparative studies of subject-access 
alternatives 

User - system ; 

• Diversity of user types, diversity of questions 

. Aspects of questions; 

models of query formulation 
query expression 
query characterization 
query profile transformation 
vocabulary browsing 
nonsu^)\^ct indicators 

• Lack of application of user /use for system improvement 
. Immediacy of access 

. Roles of intermediaries 

Two major kinds of subject-access interfaces were addressed 
by workshop participants: the complex interrelationships between 
different cub ject-access systems and the relationship of the 
systems to their users. Issues were raised in both areas. 



--Shop participants noted 

avaiLble ; ^^T^" °" the quantity and variety of 

available subject-access* systems ru,r. ■ . 

° xssues dealt with thia 

comparison of different svst^m« • . aeait. wxtn the 

between Hh. ' ^^.^^ems - specxfically, the relationship 

- LcLs alfe .-^^ access services, and subject! 

Z fori rr ^'^^-^^ ^""^^'^^ °^ user-.uestions. 

ine tormer xssue raises several noint-o ^ . 
that of m^nM 1 ' ""^^ of comparison,- including 

tnat of manual versus online systems :,r,H k 

systems and book versus journal 
^^txcle coveraafi i«rv.-;i^ i.t- jy^^i-naj. 

ac=cess sv.tel , Performance of individual subject- 

studv Of a-rr '"^"^""^ evaluated, a need for a comparative 
study of different systems was expressed. 

Viewing the furur*:* ;pe 
• access systems „i be irectl""""""* "^"^ 

online mode raises issues of . ^" 
avaiia-oie data bases 1 tlrr^re^b ^"""^ 

account the^widelv varvin , searches will take into 

.a.e up today.snu;,™ s?rr: ^ ""^ 
Of dat-:, K y^T^em. To help xn the selection 

Of data bases, a descriptive register of ^ v 

online, was indicated as desirable n ^"^"^^^^^ 
area was as desxrable. Unxform.ity in the language- 

area was generally rejected in favor nf k - 

would allow a kind of translationT techniques which 

or translation among subject-access schemes. 

the 'rlaT T "^^^"^^^^^^ interfaces, thought was given to 
the range of users, including both the end user and' 

xntermediaries. Classes of users were . • - 

_ , ^. ut,ers were mentxonea — the aeneral 

populatxor^ scientists, researchers in t-^, v. : • S^n^ral 
business peopie, special interest o L 

even greater diversity of questions Is a^LT o^'uf Z ^ 

natureT; ; " were raised concerning tb 

forlr I ^"-='-ns. includin, the need for models" of ^uery 

formulation, query exnrf:*QQi r^r, 4"«^^y 
' M*-*^'*-^ ^Apression. Querv nh^ r^*- ^^^^ a. • ' • 

to-orofil^ ^ . '^"'"''^ ^"^-acterxzation, and quej-y- 

to profile . t^ransformation. Related ssues w-1 i i 
r^^^i . . —issues will require tVi*:* 

exploration of search techniques such v, , 

the Mso -iniques such as vocabulary browsing and 

the use of nonsubject indicators t-^ ^- 

■ . -^waxL-dtors . I:nmediacy of access wi t-v. 

online systems was noted, and the effec- ofth 1 

deemed to be an issue Fin.ii . °" ^"^^ "^^^ 

an issue. Finally, developrrent work leading to full 
use Of user feedback in system design was called for. 
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I After the identification of subject-access issues, the 
workshop participants split into small groups to develop 
suggestions for recommendations . Each group brought back a set 
of draft recommendations to the full workshop; and these were 
briefly discussed. Many recommendations, some overlapping, were 
presented and it was left to the Planning Committee to build upon 
the sense of . the meeting to develop the final summary 
recommendations. * 
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DRAFT RECOMMENDATIONS OF DISCUSSION GROUPS 

• V . ^ ■ ■ 

Each the five groups produced a set of recommendations. 

These were brought to the total workshop for discussion. Below 
are the statements ^ of f35<^m all groups • 

Group A , , . • ■ 

1. investigate the desi-gn characteristics of a dynamic file 
syst>^ based on file-usage data from the user population and from 

^collection change ^and growth. The system itself should be time 
independent, ada;pr^ng to changing conditions through accumulation 
and organization of information to guide its . functioning. 
Evaluation will include estimates of performance and cost of the 
file systems. 

2. Examine the comprehensive and macro-level relationships of 
the various sub ject-access.'; approaches to the information sought. 
A pilot project should investigate the response and performance 
of various syj^tems by taking a few selected questions and 
following through, the card catalog subject headings, a vertical 
^Lle, abstracting and indexing services to ' journal articles, 
back-of-the-book indexes, etc. The search trail would be traced; 
subject/entry points would be compared; the amount of matierial 
scanned would be recorded; and the results of each s.earch would 
be corf^pared. Two questions to be answered are "What information 
dc>es the user, take to the shelf in a library to identify material 
sought?" and "How many items have to be scanned before the 
information sought is ^bund?" ^ 

3. Analyze jthe question-asking process from concept formulatiom" 
.to the interrogation. of the system, including the formulation and 

expression of questions and the reformulation of selected 
questions in light of the system's response. 



4.' Construct a subject-access system incor'.porating comprehensive 
nonsubject indicators (e.g., journal title,, /publisher, author, 
etc.) for moliographs and other literature similar to the combined 
searching approaches available through automated abstracting and 
indexing services. Kesearch has indicated that searchers rely on 
such data as publisher- or journal title as quality indicators. 

Group B 

1/;.^ Devote effort to exploring the technical and economic-j 
feasibility and utility of other approaches to multifile subject 
access rather than support continuing development of broad-based 
universal subject description schemes and automatic subject- 
switching mechanisms. Among other approaches include: . 

a. the building of composite indexing records from 
multiple sources, 

b. developing publicly available retrieval' aids (e.g., 
stored profile parameters) . 

2. Encourage .increased cooperative efforts between and among A&I 
services and» the national libraries to harmonize subject-access 
vocabulary in areas of overlapping interest. 

3. Investigate the complementarity of subject access to 
monographs as currently applied by the national libraries and the 
A&I services, via an applied research project. 

4. Make increased use of user feedback' and data available . from 
retrieval services to improve the design of data bases, through 
applied research projects. 
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Group C 

1. Mobilize funding support to assist the Library of Congress 
to: 

^* oreate an online system to manage and distribute 
BUb ject authorities , 

Undertake necessary vocabulary reform in th- list 
of LC Subject Headings and alter practices in 
f\ssigning subject headings to works cataloged, 
^* eliminate subject cataloging backlogs so that 
•"Attention can be turned to making subject changes 
without the pressure of undone work hampering 
V^rogress in this area. 

2. Fund thrt creation of a descriptive register of data bases, 
with a stateuvi^i^t of the scope of each. y 

3. Seek fu\\.,Jiing so that the Library of Congress can provide 
increased suU^^^^ access to audiovisual materials by: 

^ • Adding cataloged audiovisual entries to the LC MARC 
vxata base, 

extending the Ca taloging-in~Publication Program to 
audiovisual materials , 

<«^xpanding the number of subject (and other) access 
v*ioints to audiovisual materials. 

V/ ' 

4. Fund comprehensive study of various levels of use 
including: 

^* the behavior of users of_,card catalogs and other 
methods of subject access at different types of 
institutions (public libraries, research libraries, 
etc.); 
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b . barriers to subject access experienced by various 
categories of users, e.g., researchers, users of 
archives, public libraries, etc.,, and categories of 
users such as the disadvantaged, imprisoned, etc. 

5. Investigate the p^^vision of multiple levels of access to the 
same materials, providing funding for such a study and then 
funding for implementation if this is determined to be feasible . 
For instance: 

a . investigate the feasibility of providing of 
nonresearch subject headings to public libraries; 

b. provide cross linkages between vocabularies, 
classification schemes, ^^^^ — other taxonomical 
access tools, such as linking^ terms in Sears, LCSH, 
and ERIC. 

c. support and accommodate special refinements of 
subject-access tools such as special 
classifications. 

Two additional suggestions that were discussed in the group but 
not included in the recommendations above were: 

1« Produce an illustrated manual on methods of ' making subject 
heading changes in card catalogs. 

2. Hold a conference/workshop to study the possibility of 
creating a subject heading code (rules) . 

Policy statements agreed on by the group: 

1. Standardization of vocabularies, classification schemes and 
access methods is a non-goal. 

2. All types of publicly accessible information services must be 
included in considerations of subject access: public libraries, 
academic libraries, research libraries, archives , special 
libraries, etc. 
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Group D ••. 

1. With the desire to encourage the diversity that will best 
serve the end user and in response to the. problems perceived in 
subject access, it was recommended that subject access should be 
the responsibility of each discipline or type of library. New 
organizational structures as needed should be developed through 
already existing A6=l services or through other professional 
specialist groups. Zero-base thinking should be • used to spur 
innovation. In other words, start from scratch so that advanced 
techniques such as post-controlled vocabulary and automatic 
indexing . can be rapidly utilized. This approach can be of 
particular value to the humanities. 

2. In order to encourage an environment for change it is 
suggested that the library community act as user intermediary to 
the agencies that produce information, e.g., the A&I services, 
index publishers, and other publishers by focusing on end user 
needs and informing those agencies of those needs. The expense 
involved in change may then be justified. With only an 
occasional complaint or suggestion from the marketplace the 
producers cannot evaluate change to benefit the user. 

Group E 

1. Perform research on the severdl general problems with subject 
access in the humanities that may be different from those in the 
sciences and engineering. One problem is that not much 
information is available on how humanists do subject searching. 
There is some question as to whether some notions about the 
potential value of A&I services are based on fact or on 
speculation and hearsay. 

2. Retain diversity in subject-access mechanisms. We are unable 
to anticipate all the needs of individuals seeking subject- 
related information in all disciplines. 
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3. Encourage libraries that require subject access in ^^e^ter 
depth than that provided by LC to band together and po^ their 



resources . 



4. Make direct access by users, insofar as possible, the ^o^^ 
subject-access systems. This should not be restricted inline 
systems but should also include manual systems such ^^^^ 
catalogs; that is, subject-access systems should be f^^^^e aS 
simple and user interactive as possible. 

5. Establish basic "standards for the bibli^^t^phic 
identification of documents. 

The recommendations that emanated from the groups were 3^^^^ssed 
by the full meeting. A summary of the recommendations is 
provided below for completeness: 

1. Support and endorse a diversity of subject-access ap^^^^Qhes- 

2. Make simplicity in the use of a subject-access ^^stem a 
general goal, but the intended audience should be a ^^^^e for 
inclusion of specific techniques that will be most effective. 

3. Make access available both through controlled voc^^^l^rieS 
and natural language, where possible. 

4. Establish a constantly updated directory of avails data 
bases including access, scope, and cost. 

5. Encourage library schools to include courses on s^^jgct 
access beyond the technical facility of subject headi-^^s, and 
extend such programs as part of continuing education. 
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.6. Perform research to determine the basis on which records might 
be transferred from online status to some type of record .... ,e 
considering such factors as age, frequency of use, ard s.^!!^: 
field for the purpose of lo« priority retrieval. 

7 Encourage «EH and other appropriate agencies to support 
research aimed at identifying the differing subject-access nLs 
Of the humanities as compared to other disciplines with the aim 
Of promoting the development of new subject-accoss tools in areas 
Where there are now none and improving those ' that are 
unsatisfactory. 



-37- 



SYNTHESIS AND ACTION RECOMMENDATIONS 

Introduction 

Building on the experiences of planning the workshop, the 
discussions that took place, and the recommendations suggesl.ed, 
th^e planning committee has constructed a limited- number of action 
recommendations as presented^ below. To the extent that the 
participants contributed directly to these results through the 
workshop and follow-up correspondence, full credit for their 
insight and judgment is intended. However,- to the extent that 
the planning committee has selected and combined ideas, placed 
emphasis, and otherwise shaped the action recommendations below, 
it has done so on its own responsibility and accepts any 
criticism that may accrue. It does not wish to imply a complete 
consensus with this synthesis by all the participants in the 
workshop, but the planning committee does hope that it has 
correctly understood the results of the workshop and expressed 
views and actions that most of the participants will support by 
their own actions. 

In addition, recommendations recorded at the workshop but 
not subsumed in the planning committee synthesis contain other 
actionable ideas (see preceding section of the report on draft 
recommendations) . The planning committee invites thoge who have a 
specific interest in aspects that were not selected for emphasis 
to take the initiative in developing them. 

Major Th femes 

A few themes relating to subject-access techniques and 
activities stood out strongly during the workshop. The most 
overwhelming of these was "diversity," even though one of the 
intentions of the workshop was to seek "useful commonality" of 
problems, approaches, and resource sharing. Moreover, there are' 
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ser.ious communication and understanding gaps betv/een the ' 
practitioners in the various" communities of interest. 

It is clear that diversity in subject-access needs and 
approaches 

.is real , 
. is gopd, 
. will continue. 

Diversity comes from the many environments of use of information 
and the varying structure and culture ' of sub ject areas or 
disciplines. wo rk to improve subject -access must take int o 
account t he fact that there is no one way that can be appl ied to 
all areas. Howeve r, there are usef ul linkages among approaches 
and"""^ctlvitles that a ppear desirabl e but _which d o "Ho t yet7~ exis t . 
Helping to high: ight these linkages was one intention and theme 
of the workshop. 

To illustrate sources of diversity: groups of institutions 
serving the general public have needs for subject-access 
mechanisms much dif f erent , f rom those of major research libraries. 
Those needs are determined by the purposes to be served; the age, 
educational level, and culture of the patrons; the location of 
the population being served and the logistics of serving it; and 
other special requirements. 

Another strong theme was that subject-access mechanisms 
should be designed for end users and not exclusively for 
information professionals -- whether the subject-access tool is a 
. card catalog or an interactive computer search system. This is 
not to say that professional intermediaries should not or will 
not be employed in many endeavors. Users should not and need^not 
be intimidated . by the language and complexities of formal 
information tools nor be inhibited by the limited population of 
information professionals. 

Consistent v;ith the emphasis on adaptation to diverse needs 

and environments , "postcaardiiiation:! 5^___emEhasized oyer 

"precoordination" in the design of information-access tools. 
Tho'sT'^ho organize information for retrieval can not anticipate 
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ail future interests in and uses of the inrformation. Thus, at 
least some component of the overall storage and retrieval system 
(and. some would say, the major component) must be a way to 
determine at the time of retrieval relevancies not reflected in 
the original organization of the collection being searched. The 
possibilities for achieving this are improving with modern 
computer-based systems . 

Groups having common interests must band together for mutual 
benefit, and resource sharing or other unified actions must go 
well beyond the boundaries of libraries to include publishers, 
abstracting and indexing organizations, and other information 
.agencies. 

An area that regards itself as deficient in organized 
subject-access tools, such as the humanities, can learn from 
other areas, such as science and technology, that have put more 
effort into development of such tools. However, the techniques 
developed for the sciences should not be accepted blindly; all 
may not be relevant to the unique information-seeking needs and 
conventions of the humanities . 

Yet another theme was that unive rsal standa rdization should 
not be an end in itself. Carefully selected -standardization can 
be a useful, even necessary, tool in the reduction of redundant 
effojtrt; and local standardization (by subject area or - use 
environment) is a tool in effective system design. However> 
interlinkage among the various communities -of interest (e.g. 
among the various vocabularies of subject cataloging and 
indexing) is far more practical and acceptable, and therefore 
possibly achievable, than any attempt at absolute uniformity. 

Finally, a single grand plan with centralized funding cannot 
be expected. Whether or not a logical cas.e can be made for a 
single national plan, it simply will not happen in the 
pluralistic society and economy of the United States. • However, 
techniques and technology developed in or applied to one area may 
be of benefit to other areas, and furthering the transfer of 
useful technology is in' the national interest. ' 
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Ac.tion Recommendations 

^The following action recommendations each require an agent 
for an organizing initiative, a funding source (possibly 
separa^te) , and a set of people and organizations to do the work. 
The planning committee does not assume the authority to designate 
such participants by listing them in this report. Real progress 
can be accomplished only if those who have the capability to 
perform the work, those who can benefit, and those who can 
facilitate progress through providing study and development funds 
will step forward and take initiative and responsibility. By 
stating desired and specific actions in the paragraphs that 
follow, the planning committee hopes to provide a catalyst for 
that £0 happen.^ , 

1. . Interlinkage of library and AficI produced subject-access 
tools" 

Problem ; How to bridge between library catalogs and 

subject-access tools and approaches produced by 
abstracting and indexing organizations (or 
other specialized information access 

organizations) to include: 

• both printed and computerized mechanisms, 

• both present techniques and evolution toward 
future possibilities, - 

. educational efforts in library and 
information / science schools and in-service 
training, 

. work tools needed to apply the techniques, 
. integration of mechanisms both by the , creator 
of subject-access tools and at the .point of 
use of disparate tools. 
Actions ; a. Locate action agents in each of the library and 
A&I communities who will collaborate to develop 
demonstration projects in a specific limited 
Subject area in both of the categories of 
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"hard'V science (e.g. physics, chemistry, 
biology, mathematics, engineering, etc.) and 
"soft" science (e.g. psychology, sociology, 
anthropology, otc.).. ^ The results of the 
demonstration projects should be an 
understanding of the specific subject-access 
problems of the selected area, a solution of 
them in an integrated environment, and pilot 
experience to guide subsequent development in 
other subject areas. The specific focus is on 
how to. bring library and A&I tools together,, 
not on the development or exploitation of 
either alone. 

b. In a similar manner, plan and carry out a 
demonstration project on the linkage between 
formal catalogs and access tools to manuscripts 
and other forms of information in a specific 
area o'f interest to the humanities. 

c. At the beginning of each project, require the 
assembly and dissemination of a state-of-the- 
art review ,as it relates to the subject access 
in the chosen limited subject area, as well as 
require full documentation and dissemination of 
the final results of the project. 

d . Additional integrative efforts are needed to 
broaden awareness and understanding of the 
various kinds of subject-access techniques and 
tools. At present, publication and 
professional discussion of these subjects tend 
to be parochial and limited. Toward this end: 

. establish joint sessions and/or special 
conferences sponsored by professional 
associations (e.g. ALA, ASIS, SLA, American 
Society of Indexers, etc.) on current 
techniques and developments in subject 
access. Structure such conferences in such a 
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way as to break down communication barriers 
among practitioners in. the various 
professions (e^g. subject catalogers, 
indexers, reference librarians, information 
specialists, information system researchers, 
etc.) . • 

. Prepare publications, including up-to-date 
^ textbooks and curriculum materials, treating 

subject access' in an integrative sense for 
formal education- and training purposes* / 

Toward a System view of future subject-access mechanisms 
Problem ; How to design a large subject-access system 

that adapts to usage patterns and collection 
change and growth without requiring disruptive 
\ alterations in system structure or costly file 

\ conversions. Motivations for solving, this 

^ problem come from the factors that are causing 

V the closing of large card catalogs, .the 
difficulties of weeding out inactive material 
from large collections, and the changes in 
vocabulary and concepts of interest as time 
passes. 

Action ; Conduct a research project on the design 

characteristics of a large dynamic file system 
/ based on file usage data from the user 

population and from collection change and 
growth. Collections in excess of one million 
active items requiring files in excess of one 
billion bytes are considered large for this 
purpose. The project should include; , 
. the study of use patterns and user behavior 
' in subject access at different types of 
institutions, 
. collection-purging criteria and phenomena, 
• consideration of a user- (or use) created ' 
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C'4.:3ss-ref erence system, 
i theoretical characterization and analysis, 

• synthesis into a system design with a^'o^lyv/iB 
of its dynamics and economics, 

. application to one or more demonstration 

projects in limited areas*. 
The results should have implications for: 
. design cf- future large systems, 

• decisions on selective conversion of back 
files. to new systems, 

• establishment of subject access for audio— ^ 
visual and machine7readable files not now 
satisfactorily cataloged and indexed for 
subject retrieval, 

• adaptation to end user needs. 

Gaps in subject-access tools for the general population 
Problem ; How to bridge gaps in the availability of 

J subject-access mechanisms to serve special 
needs, in school and public libraries. Great 
diversity of need arises from the broad range 
of - age levels^ socio-economic environment, 
la'nguage, and the- special needs of the 
handicapped. Yet the extent and nature of 
these needs are not well understood by much of 
the information profession. 
Action; Make and dissemin^ate an analysis (description 

^and evaluation) of the availability and 
suitability ( usefulness/ effectiveness for the 
audienctf) of subject-access approaches and 
tools for users of school and public libraries 
for all purposes (e.g. to include at least 
^ educational/ personal/ and community activities 

purposes). The study should include 

determining the "how" and the "why" of uses by 
each population segment . 
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Subject access for> the humanities 

Problem ;. How to respond to the general perception of 

- those directly involved " that subject-access 
tools for the humanities are incomplete and 

■ lacking in. availability. Although „ certain 

' ' quality ^ tools ■ do4 exist (e.g. 

Philosopher's IndeV , Arts-&-Hu manities Cita- 
tion Index ) , there-' are many unfulfilled needs, 
and methods used in other disciplines of knowl- 
edge are not as extensively applied in the 
Humanities. 

Actions: a. Conduct research- to determine why - this 
situation- exists that will study these 
questions : 

.What is the nature of the humanities 

literature? . 0> 

. How do humanists use information? 
. What design of information tools is 
appropriate to respond to the characteristicp 
of the literature of the humanities? 
b. Set up orientation courses and follow-up task 
groups for humanists to review information 
techniques used in other fields, to critically 
evaluate them, 'and to explore how to apply the 
techniques, that are judged to be relevant to 
their needs. 

5. Subject access to monographs 

Problem : How to deal with the assertion that detailed 

subject access, to' monographs is inadequate via 
subject headings in card (ot equivalent) 
catalogs and that other tools /or that purpose. 
. generally do not exist. Recent studies show 
that more detailed subject access to monographs 
is potentially useful, at least for non-fiction 
works. Ideally, the analysis and derivation of 
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in-depth subject cataloging and indexing 
material should be done at the source .-(by 
author or, publisher), but /there. is no economic 
incentive to do so; A, separate' 

analysis/indexing process in the- manner .of 
treatment of journal articled by abstracting 
and indexing organizations is uneconomic due to 
the time required for a subject expert to read 
the monograph and\ perform the analysis . The 
tables of contents and back-of-the-book indexes, 
in many monographs are not exploited for this, 
purpose. In addition to ' general' information 
retrieval purposes, good subject access to 
monographs would enhance the effectiveness of 
blanket order plans with publishers by more 
specific screening to fit the needs of, specific 
libraries. Many technical ideas that have been 
suggested at various times have fpiiled for 
economic reasons. 

The workshop generated no suggestions that 
•would have widespread applicability for making 
significant headway on this problem. The 
situation remains an important unsolved problem 
to be /faced by publishers,- abstracting and 
indexing activities , and libraries, singly or' 
together. 

Without prejudging the outcome and without 
suggesting that the method would solve the 
entire problem, the following research project 
might lead^ to useful results for some future 
purposes . Study ^-the generation of useful index 
material by statistical and/or linguistic 
analysis of the text of monographs available in 
machine-readable form as a by-product of 
^computer-controlled con^osition. - The condi- 
tions that lead to this recommendation ar*e: 
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. the increasing use computers by publirhers, 
• the capacity and speed of computers are 

rising and costs are falling dramatically, 
. something useful should be better than 

nothing, as is the situation now; traditional 

subject headings or index entries need not be 

required. 

6p Subject-access authority files 

Problem : How to cope with- the creation of a national 

library network which requires that subject- 
access mechanisms be ^established to work in 
that environment and to provide multi- 
inistitutional access. One key to the effective 
functions of such a system is an appropriate 
i subject authority system of broad 

applicability . 

Action: a. Include in the work toward an authority system 
for the Bibliographic Service Development 
Program means to link subject-access vocabulary 
authorities of both the library and the- A&I 
communities and use the participation of both 
communities in developing that linkage, 
b. As a related effort, assemble a work group to 
study the possibility of defining codes (as in 
"cataloging code" or "indexing rules" ) to guide 
the construction and use of each subject-access 
vocabulary included, in the authority system. 
Work guides used in various subject areas by 
the corresponding cataloging or indexing 
agencies should provide much guidance for this 
work. 
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cno-rw^T J^CCESS IN P.ERSgE CTIVE 
Professor F. Wilfrid Lancaster 
October 18, 1978 



T nohired wh-n I began to prepare for 
Than, you ve.y .uch. I .ot^c., «^^ ^^^^^^^ ^^^^ 

this talk that its acronym is SAP. I am 

si.nUlcant or not. But In my ^-""^^ J^^,,^^ 

r ::::r :r:::^r. tiim. a.out th. 

subject access ' , hat I did approximately ten 

problems Z was --f/^;, „ ° «.aonal Library o. Medicine 

years ago When I was working 

a -aged^n an e.alu tion ^o.^^^^^ ^^^^^^^^ 
library. This approximately 

performance °' ^ '"/^ undertook for their 

300, that the ^National Library o ^^^^.^^^^ ^^^^ 

users. AS you can imagin. .o.^ of th^ ^ ^ ^^^^^^ 

completely , , „,u ever forget them. 

all-time favorites ^ ^ ^ ,,ese with you. 

I would likfe to begin this t.ilK oy ^ - 

= .he searches conducted was ' on the subject or 
Tion anx ety T c^lldhoc., the objective being to retrieve 

separation anxiety ,„^i^tv of a child at being separated 

,„formation --"^^ , " Ltely Irrelevant articles that 

from Its parents One of ^ delightful title of 

was retrieved by this 

■psychological Aspects ^--ision^^^^^ ^^^^^^^ ^^^^^ . 

same group was on ^^^J^^^^ premature rupture of the fetal 
membranes. The search P document entitled 

membranes retrieved a -"P^^^^'^'""' " ^. £,cts for two 

J .- ^4-" Now I mention these cwu 
..project Headstart . , Now ^^^^^^ 

p^r-Qh of all I could not tninK 

reasons. First ot aix u^^^use these examples are 

' \ • i^alk and secondly, because cne:= 

o. t^: fUt that even with highly tramed mdexers. a 



carefully controlled vocabulary, and experienced searchers , 
information retrieval systems do not always retrieve what we want 
or expect them to retrieve, although for certain purposes they 
may retrieve something better, 

I think everybody in this room would be willing to agree 
that we have seen very great improvements in methods of subject 
access since World War II. In this period there have been, I 
believe, four major developments. The first was the development 
of the first .so-called coordinate or pos tcoordinate indexing 
systems, which allowed us to achieve for the first time freedom 
from the linear sequencing of subject terms that was 
characteristic of card catalogs and printed indexes; and indeed 
to achieve for the first time a true multi-dimensional approach 
to subject matter efficiently and econorically . Later, we^ saw 
the progressively more sophisticated implementation of these 
postcoordinate systems through the principles of optical 
coincidence, edge-notched cards and microfilm; later, punched 
cards; and somewhat later still, through magnetic tape and 
magnetic disc. Thirdly, we've seen a quite spectacular growth in 
the number of machine- readable databases and data banks available 
for the provision of information ^s-ervices . These have grown from 
one to several hundred in approximately a fifteen year period. 
Fourthly, we have seen an equally rapid growth of ^computing and 
telecommunications facilities for making these resources 
accessible widely and inexpensively online. 

The period since the Rapid Selector and since the work of 
Batten in England and Calvin Mooers in the United St ates in the 
1940 's has been one of rapid change in methods of subject access. 
It has been a period in which wheels have been discovered, 
rediscovered, forgotten and discovered again. It's a period in 
which we have gone from controlled vocabularies, to natural 
language, to controlled vocabularies and back again to natural 
language. It's a period in which we've gone from systems, namely 
card catalogs and printed indexes, designed to be used by more or 
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less anyone, to more esoteric systems designed only to be used by 
information specialists. Now we are trying to rediscover how to 
design effective "anyone systems" again. What have we learned in 
this period? What subject access truths have emerged? What 
knowledge has been gained to guide information systems design in 
the future? What are the gaps in our knowledge? What are the 
neglected areas of research? Let me mention some of what I feel 
to be the major things that we've discovered in the last few 
years . 

Subject access systems are complex organisms. A succession 
of evaluation studies has revealed that there are indeed many 
factors which determine the success or failure of a search in 
such a system. These factors include: the coverage of the 
database; the indexing policies of the database producer; the 
quality and consistency of indexing in that database; the 
effectiveness of the interaction between the users and the 
system; the completeness of the searching strategies; and the 
characteristics of the vocabulary used in the indexinc, and 
searching operations. 

One of the problems, and in some sense the major problem, 
faced in the provision of information service, is that most of 
these factors are not directly under the control of most 
information c nters. In fact, in a typical information service 
situation the information center controls only a few of the 
factors governing its own performance. Consider, for. example, an 
industrial library searching machine-readable databases on behalf 
of its engineer and scientist users through some online service 
center — I'll use Lockheed as an example in deference to Charlie 
Bourne — through Lockheed. The library itself controls 

selection of the database or databases it is going to search, but 
the staff of the local library has no control whatsoever over the 
quality of the database it uses. it has no control over its 
coverage; it has no control over its indexing policies; it has no 
control over the accuracy of the indexing; and it has no control 
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over the vocabulary used to represent the subject matter/ It has 
some control over its own search strategies but not even complete 
control over these because the search strategies must be 
constrained by the characteristics of the database itself and by 
what the system software^ the query language in this case, allows 
the searcher to do. The local information center has most 
control over the methods by which it interacts with its own 
users. But even here the control is not absolute since it is 
impossible to exercise complete control over users and their 
behavior. You can lead a horse to water but you can't make him 
(or her) drink. 

Another thing that has emerged rather clearly in the last 
few years is that it is rather difficult^ in fact extremely 
difficult, to evaluate the performance of subject access systems. 
The results of a known item search can be judged as either 
successful or not. But the results of a subject search cannot be 
evaluated on a simple binary scale. They must be assessed on 
some form of graduated scale which takes into account how much of 
what is wanted is retrieved and how much of what is retrieved is 
wanted. Such a scale implies the use of values, such as the 
values of relevance or pertinence, which tend to be subjective, 
transient and relative rather than objective, stable, and 
absolute. 

Another thing that I believe we've discovered is that there 
is. no single correct approach to the design and implementation of 
information retrieval systems. A system capable of performing at 
some specified level can be implemented in several possible ways . 
In this connection, it is important to recognize the obvious 
trade-off between input and output costs in the operation of 
information services. We can choose to deliberately reduce the 
input costs or effort, for example by adopting some minimal level 
of " indexing such as keyword-in-context , ' but only at the expense 
of increased cost and effort at the time of the search for the 
information. • This leads me to the related matter of 
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standardization. Efficient access to documents by author or 
title requires standardization. The fact that author/title 
cataloging can be reduced to standard rules equally applicable in 
all libraries means that these activities can be and should be 
handled in a centralized or cooperative mode. But we must be 
very careful when we talk about standardization applied to 
subject access. 

In the notes prepared by the organizers of this meeting an 
important question* is raised as follows: "How can we :riiminate 
expensive and time-consuming duplication of intellectual 
processes in creating subj ect access to bibliographic entities? " 
My answer -to this is that I do not believe we can, at least not 
by using any conventional approach to the problemc Subject 
indexing and subject cataloging are not susceptible to 
standardization and centralization.. Indeed, standardization and 
centralization ' may be • regarded as the natural enemies of 
effective subject access. 

, ^Effective i^iifaject indexing involves the establishing of 
answers to two quest icns . The first question. What, is this 
document about? The st^cond questicii. Why are we in;:tereited i 
it? Question number one may be handled in some • standardized, 
centralized way. Question number two cannot by any stretch of 
the imagination be - centralized or i^tandardized. Ten different 
organizations may all agiree ;>n what a particular document is 
about, but each may have completely df.fferent interests in this 
document and completely different reasons for adding it to their 
collections. They have, theiref ore , Completely different needs 
for subject access to this dc-ument. We should not be disturbed 
or even surprised to find that the same docuiaent has been indexed 
in ten different ways, in ten different information centers. 
Indeed the reverse is true. It would be cause for surprise and 
for conce>rn . if we did find the identical indexing of . this 
document in all t.en organizations. 

Another point, we ^can recognize over the . years a move to 
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simplification in the design and implementation of information 
. retrieval systems. The systems of today, although implemented on 
more sophisticated equipment , are in many ways simpler than many 
in use in the 1960's. Many of the earlier systems were over- 
designed, incorporating, for example, links, roles, relational 
indicators, semantic codes and other devices to filter out noise 
or to achieve more complete search results. These refinements 
could rarely be justified from a cost-effectiveness viewpoint. 
They gave marginal improvements in performance and these marginal 
improvements were not enough .to 'offset the greatly increased 
.costs at input, 

f ' ' • : 

Information retrieval now seems to be looked at much more 
pragmatically than it was twenty, years ago. There seems now to 
be more realism in our approach to information retrieval 
problems. Writers twenty years ago were very much concerned with 
identifying possible semantic or syntactic problems^t^fech might 
affect the retrieval process , Most of the'problems that they 
unearthed were more theoretical than practical. For .e^xample, is 
it likely, that anyone will be looking for information on blind 
Venetians? If so, is it likely that information on blind 
Venetians will appear in the same database as information on 
Venetian blinds. And if it does, how; much effort is needed to 
segregate the blind Venetians from the Venetian blinds after the 
search is completed^? 

>It is sometimes overlooked that words that are quite 
ambiguous in isolation may be quite unambiguous within the 
context of other words. Information retrieval is rarely 
concerned with words in isolation^ 

Finally, I think, in terms of our discoveries over the last 
few years, it has been recognized that it may be possible to 
provide effective subject access through approaches that are 
somewhat uaconventional, including approaches that may be 
partially or completely independent of the vagaries of language. 
These approaches include the approaches of citation indexing, 
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bibliographic coupling, and some form of pattern matching whereby 
we can ask a retrieval system to find other documents that are in 
some sense like a document we already know to be relevant to our 
interests. 

We have learned a lot about subject access since the 1940s. 
While we have not found , all the answers, at least we may have 
identified some of the more important questions. However, much 
still remains to be learned. There are many gaps in our 
knowledge and many research areas have been neglected. For 
example, the most critical aspect of an information service is 
that, of the interface between the information service and its 
users. When I talk about the user system interface I do not 
necessarily mean man/machine interaction. In fact, I am more 
likely to' mean person to person interaction: the interface 
between a user with some information need and a member of some 
information staff who will attempt to satisfy this need. I refer 
to this as a critical area since it is here that a latent n^^ed is 
transformed into an expressed need, that is, a request for 
service from the information center. . Clearly, the whole 
information retrieval operation, depends upon the success of this 
transformation, that is, the extent to which the expressed need 
accurately reflects the actual need. We may have a system in 
which documents are indexed exhaustively, the quality of the 
indexing is carefully controlled, the vocabulary is carefully 
constructed and sufficiently specific, and the search strategies 
are comprehensive ; but all of this matters "little^ if in fact we 
are looking for information. our users neither need nor want. 

The user/system interface is the most critical component of 
an information retrieval system. It is also in many ways the 
most neglected. We still do not know for sure what -methods of 
interaction are most effective in capturing requests that are 
accurate representations of the needs that underlie them. It is 
at the user/system interface that most operating information 
services can achieve the most significant improvements in their 
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own performance. 

We do not. know a great deal either about the language with 
which users approach subject catalogs and subject indexes. From 
where do they draw their search terms? What are the factors that 
i.nfluence the user's choice of entry terms? Symptomatic of 
neglect "in this area is the fact that most computer-based systems 
"have trac^itionally maintained records of the frequency with which 
terms are used in indexing; but how many have routinely 
maintained records of the' frequency with which terms are used in 
searching? 

In the 1960's considerable experience . was accumulated in the 
design of computerbased retrieval systems. But most of this 
experience was experience related to the design of ;systems to be 
used by information specialists. Now we are faced with the 
design of systems to be used in a non-delegated mode by chemists> 
physicists, economists, and other professional practitioners. 
This is a completely different kettle of fish. Should we design 
such systems so that the workings of these systems, rather like 
the. automatic transmission in an automobile, are completely 
transparent to the user, or should we design them with sensitive 
and high performafice gear boxes which are entirely under user 
control? Is it reasonable to expect the infrequent system user 
to learn the nuances or, I would even say, idiosyncracies ^of a 
controlled vocabulary? 'Or is it more reasonable to design the 
system in such a way that it can .be interrogated in the user's 
own uncontrolled vocabulary — the natural language of his subject 
field? ■ 

What of the future of subject access? The changes of the 
next forty years will surely be more dramatic than those of the 
past forty. The entire structure of the provision of information 
services, especially the economics of these services, is 
changing. Indeed the entire structure of formal and informal 
communication may be changing. We no longer need to think 6t a 
major reference library as a large collection of print ent:losed 
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within four wall. We no longer need to make a capital investment 
in. a database to make it physically accessible, that is, we no 
longer" need to have it sitting on the shelves of a library. 
Information services are moving rapidly into an. on--demand, pay- 
as-you-go environment; an environment in which one can purchase 
access to whatever information sources are needed at the time 
th^y are needed. Th is situation greatly improves the 

accessibility of information sources, since a database needed 
once a year is as accessible physically, if not intellectually, 
as one needed a hundred times a year. 

So far most of the use of online services has been use in a 
delegated or mediated mode. This will undo'ubtedly change as 
terminals fall in price and, as. a result , become increasingly 
available. The problems of subject access in the future then are 
likely to. be related to the design and implementation of user 
oriented systems. Systems whereby the practitioner in some 
field, who is not an information specialist, may use a terminal 
to access an almost unlimited array of information sources.. What 
are the problems of operation in this type of environment? 

The first problem may simply be that of an embarras de 
richesses . So much will be accessible through a single terminal 
that the user will be overwhelmed with the resources available. 
The first requirement will be for some form of automatic referral 
mechanism whereby a statement of the user's need in natural 
language form can be matched against some online directory of 
available resources. .The 'result of the match will be an 
identification of those^ sources which have the highest 
probability of satisfying this need together with an indication 
of how these sources can be accessed and perhaps how much it 
gosts to access them. In this environment in which many 
different databases may be accessible from a single terminal, 
much use of databases will be made by the infrequent rather than 
the regular user. This may mean a continuing move toward natural 
language and away from controlled vocabularies, at least 
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controlled vocabularies in the conventional sense, since natural 
language systeips are generally more hospitable to the infrequent 
user and since a query in natural language form can readily be 
transferred from one natural language database to another. 

Does this then imply a declining interest in thesauri and " 
vocabulary construction in general?" Not necessarily so. A 
controlled vocabulary, as you know/ has two major purposes. It 
exists to normalize tire terminology of a subject field..and it 
-exists to provide a structure whereby terms whose meanings are 
related are brought together in 'some way. It has generall^been 
assumed that these purposes must be served by a precontrolled 
vocabulary, a vocabulary in which this normalization and 
structure is imposed at the time of indexing. But we are not 
necessarily restricted to precontrolled vocabularies. There is 
no reason why we cannot develop postcontrolled vocabularies-r 
vocabularies that' are tools only of the searcher of natural 
language- databases; tools such as synonym tables and/or loosely 
structured thesaurus groups that are constructed by humans, by 
computers, or by a combination of man and machine. The future, I 
believe, lies in this direction. Through the use of natural 
language input, and some type of postcontrolled vocabulary -at 
output we can combine at least some of the advantages of both 
into a single system. 

in conclusion, in your deliberations . during the next two 
days, I would urge that you look at subject access' in terms not 
of the past and not even so much of the present, but in. terms of 
the needs of subject access in a future electronic, on-demand, 
pay-as-you-go society. Thank you very much. 
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SUBJECT ACCESS; iCHALLENGES FOR THE FUTURE 



Margaret T. Fischer 
^ October 19 , 1978 

• • '-^ • . ^ 

In 'college a very special professor was trying to instill 
the importance of good organization, dynamics, arid guick 
thinking. It was at the end of a writing course when he said to 
us, "I'm going to give you . four things to write about in foUy 
minutes. Tell me about mystery, royalty, life and religion." 
Not many people, did much for the first few minutes. Suddenly a 
student got up, went forward and handed & piece of paper to the 
professor. He started to walk out but the professor said, "Wait 
you've got to try." The student said, "I d^d. You told us you 
wanted organization, you wanted us to do it quickly, and" you 
wanted dynamics.-" The professor picked up the- paper and read: 
"My God, said the princess, I think I'm pregnant. I wonder who 
done it." Here are the words - not in isolation as Wilfrid 
Lancaster said last night. They are curt, they are clear, and 
they are concise. I would hate to index this little scenario and 
there would be delightfully unexpected smiles at ret'ieval. But 
it illustrates the heart of our problem — words. 

This simple example stresses the information process as 
being, part 6^ ■ the process of discovery and of innovation. It 
asserts that the essence of the information problem is to 
maintain knowledge- as a viable unity and that the basic 
information processes are those of selecting, of reviewing, and 
of synthesizing information. Perhaps we should think no more 
..seriously about a fully automated information system than we 
would Ujink of a time and discovery machine, but instead to work 
toward bringing collective wisdom to bear on the • everyday 
management of information using as much relevant technology as 
possible, yet mindful th^t our system will be built in levels,, 



ERIC 



some more beautifully conceited than others, rathej: like a latter 
day hanging garden of Babylon. 

VJhere ^o we begin with the Alpha and the Omega? ^*'What. is 
this all about?^ Why are we interested in it? Since every 
collections manager will answer differently we said last night, 
let the ' publisher in concert with the author begin to give tis the. 
, a'^l^ers of th*e Alpha "in which our hero does so and so," said 
David^ Batty; ' "patchwork quilt," said Lucille Gordon; "all for 
beitt.pr informa-tion; " "if that would speed delivery I'll do it;" 
Lnd "I'd rather work with book in hand." Being a publisher I'd 
rather see us 'begin with publishers. What better place to know 
what this information is, what it is all about,, and why we are 
interested in iti Publication has been extended far beyond pur 
presejnt ability to make real use of the record, so l et the record 
t hrough the publisher ' beg in ' to make use of itself* 

Our first challenge, it would seem, is to work out a way in 
harmony with publishers, not only publishers of the origiaal 
materials but publishers of the surrogate materials. They are 
sellers purveyors of information. We have heard some of them 
speak during this last day and a half. They must make money in 
brder to surv^'^ , They are as interested as we in providing the 
keys that wil J allow others to understand the value of. the 
information which they purvey. I firmly believe that we can work 
with publishers as they develop the means of selling their 
information and at the same time begin to index it. The Alpha. 
We've got to go on to say, as my parents taught me for years, God 
give me the grace to accept what I cannot change and to change 
what I cannot accept. What we must accept are some of the 
truisms that we have been speaking to each other about. 
"Inconsistency," we say as if it were a bad word. . "Inconsistency 
in indexing-." So be itl I will accept the Tefko Saracevices and 
the other researchers who tell us by their wisdom and the dint of 
their 'effort that indexing is inconsistent. Perhaps we should 
say consistency is the hobgoblin of big minds. And so let us 



acc pt . onsxstency. user - „e a.e so conoe.nea about the 

The u".^ " '-^^'"^ intermediate use., 

w e.eveYTat : "^er- . ^whoever see.s Indorsation, 

cZ T u — what we 

cannot change and get on. Kjth changing. . 

There are times „h^' publishers ■ Index or abstract for us 
yet none of us ever us/ the work Po u 
has ^h.. For years, the Mew York T imes 

has had the essence of the news abstracted on theT irst page of 
the second s^tion. The Wal^streeJ^a. rnal prints L key 
stor.es .„ capsule form ..,n^Ii;n;;^,,,^J '^J^J^ 

HLV" ^" — - know 

use o th:: T"""- ^ consciously makes 

use Of these abstracts, -including people who work at the «e„ York 
Txmes oata Bank and the Oow-aones Information System. Tet us 
open our eyes and look and see what is there. 

Pictures - I worked for twelve years for LIFE Magazine - 

G r'ie": T "° ^"^-"^ ^"5-- 

What a " " '° '^-^-^ 

What Gene saw an his mind's eye were pictures of contents pages 

Ithough e "Photographed, them in a different form. But L; 
back to thinking about snapshots. Bottom-of-the-line. television 
, ^-P--- the very cheapest set with the worst resolution. an 
transmit sixteen reasonably good pictures a second.- The record 
I- made by a moving beam of electrons sweeping across a picture 
whll ' momentarily 

Th nk if ^^r'""^ ^"- -n absorb that and more! 

Think If, for example, we took a picture of the cover of the 
contents page of a monograph, and of what we were going toindex. 
It IS not very costly because televi-sion is very inexpensive. 
Sxeen .ames every second; we can look at them almost 
subliminally or we can look at thim more slowly because we ■ 
already have: the technology to monitor at a slower scan rate. 
What I am getting at is that the picture is finished as soon as . 
It xs taken. The Polaroid people have been telling us this for 
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more than twenty years. Let us, as we ponder, think of the 
myriad of ways in which the picture is finished as soon as it is 
taken and think of that picture as information. This means we 
should try to h ave access assured at the point of publication in 
anx^QSJLL-^-n^ there are many, many forms. 

The Omega — All af.ternoon I listened; not once did anyone 
say, "information." Why are we in business? To provide 
information on all levels, in all decks, to all kinds of people. 
That means that we must try to understand' the roots by which 
information comes to us. I suspect we know somehow that facts 
are easier to deal with than ideas, that entertainment as in the 
Gothic novel is easier than knowledge, and that static, 
historical information is easier than volatile, everchanging 
information. But information — not indexing, not cataloging, 
not faceting information and the means of getting at it is our 
business. I contend that people know how to get information when 
they need it badly enough regairdless of what the systeai is. If 
the system is really bad, they will not use it. They will find 
their own way even if it is through the "College of Colleagues." 
We will get what we need when we need it some way» Perhaps above 
all we should remember that. 

If we are going to get information, the most important thing 
to seek is answers. Certainly we all know that answers have to 
do with postcontrolled vocabulary, with the vocabulary of output, 
and with online answering. In the business of analyzing answers, 
we must of course know the grand question, the save search 
strategy, the number of words used in searching, the natural 
language input, and the library of search strategies — all these 
things we have been talking to each other about for this day and 
a half. It is the question that produces the answer and the 
answer that manufactures the question. They ar^f one and the 
same. - 

I once had an experience with the business of questions and 
answers. We collected and analyzed 15,000 questions, most of 
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which came in via the telephone. We learned two essential things 
because we were able to put these questions to machine-readable 
files of abstracts, text, reports, and a vast array of junk, as 
Ed Papenfuse would call it. Sometimes like the girl with the 
curl we did very, very well, and sometimes we did horribly we 
learned first that .by and large the question s negotiated 
I because the asker does not know what he wants. The question is 
usually the beginning of a question as for example, "i want to 
know all about China." The user does not want to know all about 
China because we would have tc back up a truck and fill it. 
"Well, now, I do not really war.t to know all about China, I want 
to know about" - and the dialogue begins. r takes a long time 
before the dialogue ends. Thus we talk a.'cut the problems of 
search strategy; the problems in classification, cataloging, and 
indexing. Number two is another of the givens in which we- ask 
God to give us the grace to accept what we cannot change, namelj., 
that there must be different schemes, schedules, classifications 
and levels for different, needs . i n,ay be very wrong but I cannot 
see how we can serve a world of users with one way. The way to 
develop the various schemes is to know the market. Not too many 
of us here concentrate on markets but publishers certainly do. 
You've got to know your potential user. if you want to expand 
your businc^ss you must know to whom you are expanding it. What 
are their questions? what are their answers? what are their 
needs? Who are they? where are Uhey? Knowing the mark-et is 
knowing how to devise your product. It also tells you something 
about the kinds of answers you should provide. Some say they 
want real answers — not pointer- ,:o answers. They want data- — 
not bibliographic citations. Tn want text - not abstracts. 
They want abstracts — not expanaed titles. But the truth is 
that sometimes each of these things serves its purpose to get at 
the information. If what i want is a book ..n the renaissance, i 
really do want to use a card catalog. The catalog gives me the 
route to my answer. And so let us not throw out tnese various 
ways of getting at information. 
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For\ example, Dow-Jones has a new information service 
fashioned for people who only want three months worth of data 
because the users are interested in current information about 
business; about the stock market; about stocks and bonds; about 
people in business; and most of all about corporations and 
mergers and acquisitions. This information . is so highly volatile, 
that DOW is not going to go back retrospectively. They are 
carving out a special market for current information. 

The people at the New York Times say, "They're crazy, the 
only way to go with news information is retrospectively." The 
New York Times is penetrating another market. 

In the beginning, Dow employees that they did not need full 
text so they cut stories at the bottom and excluded some short 
stories. DOW soon discovered that its customers demanded every 
single word that has appeared in current Barrons and the 
Wall Street Journal . Users want every word because sometimes 
every word counts. Full text is required. Conversely, people at 
the New York Times say seventy to eighty percent of the answers 
that their customers need are included in the abstracts. Ergo 
there is no universal way to offer news to the marketplace. 

Now we all know .that a record, if it is to be useful, must 
be continuously extended. It must be stored in a way that it can 
be found and above all it must be consulted sometime or another. 
If we are going to extend records or change records, we must also 
"deep six" them or the seas of information ,will swell beyond the 
limits of good fishing. This is something else we have not 
really ccme to grips with — how to "deep six" a record. We are 
fearful. Mendel's concept of the laws of genetics was lost to 
the world for a whole generation because his publication did not 
reach the few people who were capable of 9. ssping it and 
t^xtending it. We fear that this sort of catastrophe will be 
repeated; that truly significant attainments will become lost 
either in the mass of the inconsequential or in the mass of What 
we do not keep. And yet, if we are going to serve our market, we 
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must learn not only how to select but how to reject later. 

Creative thought or volatile information, and repetitive 
thought or static', historical information are really very 
different things. For the latter, there are and may be powerful 
electronic aids. , The repetitive processes of thought/ like 
repetitive uses of information, can be regulated and -defined like 
matters of arithmetic,, statistics and tables. In fact, every 
time one combines and records facts* in accordance with 
established logical processes, the creative aspect of thinking is 
concerned only with the selection of the data and the process to 
be employed : — meaning, "which one do I want?" But subject 
recognition and manipulation thereafter are repetitive in nature 
and hence a fit matter for us to think about relegating to the 
machine. This business brings us to the thinking that subject 
access should possibly present subject recognition in the sense 
that we had pattern recognition developed by machines. It begs 
us to look for machines and their programs for advanced analysis. 
True, the users of advanced manipulation and advanced methods of 
manipulating data are very small in number indeed. But some very 
exciting , things are happening. There are machines for 

forecasting and report generation which take information about an 
industry or specific information about a corporation to play 
"what if" games, or to compare the company's performance against 
the national norm. There are machines for solving differential 
equations, functional and integral equations, and for that matter 
there are machines like the harmonic synthesizer which predict 
the tides. 

Now I am not hung up on machines. What I am hung up on is 
how we can possibly think about using our own minds and the 
capability of the machine to do something that we have not been 
able to do before. Within the last ten years a whole new group 
of people has devised a kind of middle man of all get out. We 
call them the purveyors of numerical databases, a misnomer. Some 
of these databases are numeri cal in content , others af e mainly 
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^ substantive and some are mixed, but all can be called information 
databases because all the information is there. You need go no 
further. .what these purveyors have done with no bibliographic 
control whatsoever is to provide subject access to industries by 
recognizing two things: 1) they know how the people in that 
industry think and the words that they use in conducting their 
business so they use these words in searching; and 2) they know 
what data these corporations need in order to conduct their daily 
business life. 

I ^am going to tell you. about one purveyor that is only five 
years old — not necessarily the most significant and certainly 
not. the -one that makes the most money at this point. This 
company is the epitome of what this new information business is 
about. Marine Management Systems is located in Stamford, 
Connecticut. Their system is called MARDATA. Five years ago the 
principal officers said to themselves, "We know the shipping 
business. Most of the information that shippers need sits in 
Lloyds of London, the Journal of Commerce and Shipping , charter 
brokers; the Tanker Advisory Center, and insurance companies that 
insure the ships that go about carrying the world's cargo. The 
world's cargo resides primarily in 30 corporations." Thirty 
corporations — just the way 180 boc publishers provide almost 
all the books that we ^Americans read — thirty corporations 
provide virtually all the shipping of the world. So MMS set out 
to get these 30 corporations by providing them with all the 
computing facilities and databases which the maritime industry 
needs to move ships about, to man them, to give them food, to 
decide which ship is better equipped to carry cotton from 
Alexandria through the Canal and Red Sea to Sri' Lanka or 
wherever. MMS developed the programs to do financial reporting, 
forecasting, planning, as well as operations. The programs 
include voyage analysis and reporting, economics and planning, 
vessel performance, transportation and cost analysis, . cargo 
booking, terminal control, and general accounting. The users are 
the top .executives in the shipping corporations. This is just ■ 
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one .example of a highly dynamic business being performed by 
people who never heard of catalog ing , of index? ng, of , subject 
access, or of bibliographic control. Information for industr^' is 
being done now in advertising, in banking, in fina.^cial 
communities — and we can and should learn lessons from this. 

Relief must be secured from laborious, detailed cataloging, 
indexing and classifying even though surely we will be doing some 
.of this for a long, long time. If the users are to be free to 
use their brains for something more than repetitive, detailed 
transformation, we must get on with our' business . One might say 
that a mathematician, or a financier or a cataloger is not a 
person that can readily manipulate figures or words. How come? 
This is . not even a person who can readily perform a 
transformation of equations for the use of calculus, or the 
transformation of words for the use of indexes. This person is 
primarily an individual who is skilled in the use of symbolic 
logic on a high plane. And especially, this is a person of 
intuitive judgment in the choice of manipulative processes that 
arv^ employed. All else, this person should be able to turn over 
to some organization or some mechanism just as confidently as one 
turns over the propelling of a car to the intricate mechanism 
under the hood, ) 

Whenever logical processes of thought are employed, that is, 
whenever thought for a time runs along an accepted groove, there 
is opportunity for better organization whether it be a catalog, 
an index, or a thesaurus. And there is opportunity to utilize 
technology to help drive it. Put a set of premises into such a 
device and an organization, turn the crank and it will pass from 
conclusion to conclusion. This is a much larger matter than 
merely* the extraction of data for the purpose of research or 
seeking information. It involves the entire process by which man 
profits by his inheritance of acquired knowledge. The prime 
t-^ction of use is selection and here we are halting indeed. Our 
ineptitude in getting at the record is caused largely by the 



Appendix B 



10 



artificiality of so many of the systems that we use for indexing, 
cataloging and fishing. We call it serendipity. ' 

We place data in storage alphabetically or numerically and 
information is found by tracing it down from subclass to 
subclass. Information is., generally in one place unless 

duplicates are used; and one has to have rules a°s to which path 
will locate it. ' The rules are cumbersome. We have talked all 
day about how cumbersome the rules are. Having found one item, 
save search or not, we have to emerge from the system and re- 
enter on a new path. The human mind does not work this way. It 
operates by association — with one item in its grasp it snaps 
instantly to the next that is suggested by the association of 
thoughts in accordance with some intricate web of trails carried 
by the cells of the brain. Trails that are not frequently 
followed are prone to fade. items are not fully permanent. The 
brain does get rid of stuff. Yet the speed of action, the 
intricacy of the trails and the detail of th^ mental pictures are 
awe inspiring beyond all else in nature. We try in our 
classification and in our thesaurus schemes but we have much more 
vertical association. Pretty much only the cross-references with 
which we grope haltingly, give us the horizontal association of 
trails. 

Humans cannot fully hope to duplicate the mental process 
artificially, but they certainly ought to be able to learn from 
it. The first idea to be drawn from tY^e analogy is selection — 
selection by association rather than by indexing with clusters or 
facets.. One cannot hope to equal the flexibility with which the 
mind follows an associative trail. But it should be possible to 
beat the mind decisively in regard to the permanence and the 
clarity d^f the items resurrected from storage. Little-used 
trails will not fade. what we need along with our association is 
a synthesizer. One would hope that we would use the reviews that 
we find in our literature, the commit toe -s for choice and 
selection, to begin the process of synthesizing knowledge. There 
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are two things that we need. . There is no question, probably 
because of Mendel, that we need, to keep what I call "raw data" 
somewhere — the Library of Congress is as good a place as any. 
But the information with which most of us live in the sense of 
management systems — giving data to a specific market for users 
to conduct their business — should be synthesized in order Zor 
it to be used wisely and profitably. This means that we want 
things ready made with a mesh of associative trails running 
through them, ready to be dropped into storage and there 
amplified. The lawyer has his associated opinions and decisions; 
the patent attorney has millions of issued patents on call; the 
physician has the reactions of patients trailed to earlier and 
similar cases coupled with pertinent notes on anatomy and 
biology; the chemist has lists of organic compounds with trails 
to the analogies of compounds and side trails to their physical 
and chemical behavior; and the journalist has biographies of 
people and corporations with a large dower of history thrown in. 

The challenge is to us — we who are ever the professionals 
trail blazing in information — people who find delight in the 
task of establishing useful trails through the enormous mass of 
the common record. I would like to think that the association 
aspect is like that old game we used to play as youngsters — 
animal, vegetable or mineral — very much like "What^s My Line?" 
Remember how quickly and with what enormous facility — using 
information theory/ by cutting things in half — we arrived with 
great speed at the answer. "Is it bigger than a breadbox?" All 
our steps, in creating or absorbing material of record, are 
perceived through one of our senses — the tactile when we touch 
keys,*' the oral when we speak or listen, the visual when we read. 
Let us learn to use better the atificial senses of multi-media 
for subject access. 

Humans have built a civilization so complex that^ we also 
need to organize records more fully if civilization is t-o-advance 
to a logical conclusion and not bog down by overtaxing a man's 
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limited memory.. Never has there been a statue erected to one who 
let well enough alone! 

I Now what is this path? It is the path of Aristotle. It is 
the path of Diderot. It is the path of Bacon. And most recently 
it is the path spelled out for us by H. G. Wells. I would like 
to read to you from a' paper he wrote on November 20th, 1936, 
^iv>2n^^at the Royal Institute of Great Britain at their weekly 
evening meeting. This is just a tiny, final portion of it. He 
speaks of the collective wisdom and knowledge of mankind and our 
need to synthesize it:. Wells cal^s it a world encyclopedia. He 
does not n:ean the Britannica or anything you and I know. Listen 
to his words. 



"This world encyclopedia would be the mental background 
of every intell igent\ man in the world. It would be 
alive and growing a\iid changing^ continualiy under 
revision, extension, and replacement from the original 
thinkers in the world everywhere. Every university and 
research institution should be feeding it. Every fresh 
mind should be brought into contact with its standing 
editorial organization. And on 'tiie^other hand, it's 
contents would be the stands iurce of material for 

the instructional side of schooj. and college work for 
the verification of facts and the testing of statements 
everywhere in the world. Journalists would dain to use 
it. Even newspaper proprietors might be made to 
r<?spect it... Such an encyclopedia would play the, roll 
o: an undogmatic bible to a world culture. It would do 
just what our scattered and disoriented intellectual 
organizations of today fall short of doing. It would 
hold the world together mentally; to hold men's minds 
together in something like a common interpretation of 
reality. There is no hope whatever of anything but an 
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until it Dut<? -iho • o =>/ so It will remain 

AC puts Its mind toaf^f-hci*- ^ 

its s..a„,e eventfuxTsto;; 3 T''^^ 
Cleverest of the ,.eat apes ..e 
Clave. 5ut not .leva. eluVh xt 

-t tMn.s .ut not Us o'n . taTc ■ 

mental contusion." 
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